SP-P7: Speech Synthesis II |
Session Type: Poster |
Time: Wednesday, March 28, 16:30 - 18:30 |
Location: Poster Area A
|
Session Chairs: Frank Soong, Microsoft Research Asia and Tomoki Toda, Nara Institute of Science and Technology
|
|
SP-P7.1: COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH HIERARCHIAL LINEAR TRANSFORMATIONS
|
Lakshmi Saheer; IDIAP Research Institute / École Polytechnique Fédérale de Lausanne
|
Junichi Yamagishi; Centre for Speech Technology Research, University of Edinburgh
|
Philip N. Garner; IDIAP Research Institute
|
John Dines; IDIAP Research Institute
|
|
SP-P7.2: STATISTICAL APPROACH TO VOICE QUALITY CONTROL IN ESOPHAGEAL SPEECH ENHANCEMENT
|
Kenzo Yamamoto; Nara Institute of Science and Technology
|
Tomoki Toda; Nara Institute of Science and Technology
|
Hironori Doi; Nara Institute of Science and Technology
|
Hiroshi Saruwatari; Nara Institute of Science and Technology
|
Kiyohiro Shikano; Nara Institute of Science and Technology
|
|
SP-P7.3: CREATING SYNTHETIC VOICES FOR CHILDREN BY ADAPTING ADULT AVERAGE VOICE USING STACKED TRANSFORMATIONS AND VTLN
|
Reima Karhila; Aalto University
|
D.R. Sanand; Norwegian University of Science and Technology
|
Mikko Kurimo; Aalto University
|
Peter Smit; Aalto University
|
|
SP-P7.4: GAUSSIAN PROCESS DYNAMICAL MODELS FOR NONPARAMETRIC SPEECH REPRESENTATION AND SYNTHESIS
|
Gustav Eje Henter; KTH Royal Institute of Technology
|
Marcus R. Frean; Victoria University of Wellington
|
W. Bastiaan Kleijn; Victoria University of Wellington
|
|
SP-P7.5: TEMPLATE-BASED PERSONALIZED SINGING VOICE SYNTHESIS
|
Ling Cen; Institute for Infocomm Research (I²R), A*STAR, Singapore
|
Minghui Dong; Institute for Infocomm Research (I²R), A*STAR, Singapore
|
Paul Chan; Institute for Infocomm Research (I²R), A*STAR, Singapore
|
|
SP-P7.6: IMPROVED MINIMUM CONVERTED TRAJECTORY ERROR TRAINING FOR REAL-TIME SPEECH-TO-LIPS CONVERSION
|
Wei Han; Shanghai Jiao Tong University
|
Lijuan Wang; Microsoft Research Asia
|
Frank Soong; Microsoft Research Asia
|
Bo Yuan; Shanghai Jiao Tong University
|
|
SP-P7.7: LOCAL LINEAR TRANSFORMATION FOR VOICE CONVERSION
|
Victor Popa; Tampere University of Technology
|
Hanna Silen; Tampere University of Technology
|
Jani Nurminen; Nokia
|
Moncef Gabbouj; Tampere University of Technology
|
|
SP-P7.8: CROSS-LINGUAL FRAME SELECTION METHOD FOR POLYGLOT SPEECH SYNTHESIS
|
Chia-Ping Chen; National Sun Yat-Sen University
|
Yi-Chin Huang; National Cheng Kung University
|
Chung-Hsien Wu; National Cheng Kung University
|
Kuan-De Lee; National Cheng Kung University
|
|
SP-P7.9: EFFECT OF ANTI-ALIASING FILTERING ON THE QUALITY OF SPEECH FROM AN HMM-BASED SYNTHESIZER
|
Yoshinori Shiga; National Institute of Information and Communications Technology
|
|
SP-P7.10: HIGH QUALITY LIP-SYNC ANIMATION FOR 3D PHOTO-REALISTIC TALKING HEAD
|
Lijuan Wang; Microsoft Research Asia
|
Wei Han; Shanghai Jiao Tong University
|
Frank Soong; Microsoft Research Asia
|
|
SP-P7.11: TOWARDS AUTOMATIC PHONETIC SEGMENTATION FOR TTS
|
Asaf Rendel; IBM
|
Alexander Sorin; IBM
|
Ron Hoory; IBM
|
Andrew Breen; Nuance
|
|
SP-P7.12: A SMALL FOOTPRINT HYBRID STATISTICAL/UNIT SELECTION TEXT-TO-SPEECH SYNTHESIS SYSTEM FOR AGGLUTINATIVE LANGUAGES
|
Ekrem Guner; Ozyegin University
|
Cenk Demiroglu; Ozyegin University
|
|