SP-P11: Robust ASR II |
| Session Type: Poster |
| Time: Thursday, March 29, 16:30 - 18:30 |
| Location: Poster Area A
|
| Session Chairs: W. Bastiaan Kleijn, Victoria University of Wellington and Satoru Hayamizu, Gifu University
|
| |
|
SP-P11.1: FACTOR ANALYSIS BASED VTS DISCRIMINATIVE ADAPTIVE TRAINING
|
| Federico Flego; Cambridge University
|
| Mark Gales; University of Cambridge
|
| |
|
SP-P11.2: COMBINING EIGENVOICE SPEAKER MODELING AND VTS-BASED ENVIRONMENT COMPENSATION FOR ROBUST SPEECH RECOGNITION
|
| Zhijian Ou; Tsinghua University
|
| Kan Deng; Tsinghua University
|
| |
|
SP-P11.3: IMPROVEMENTS TO VTS FEATURE ENHANCEMENT
|
| Jinyu Li; Microsoft Corporation
|
| Michael Seltzer; Microsoft Corporation
|
| Yifan Gong; Microsoft Corporation
|
| |
|
SP-P11.4: NON-NEGATIVE MATRIX FACTORIZATION FOR HIGHLY NOISE-ROBUST ASR: TO ENHANCE OR TO RECOGNIZE?
|
| Felix Weninger; Technische Universität München
|
| Martin Wöllmer; Technische Universität München
|
| Jürgen Geiger; Technische Universität München
|
| Björn Schuller; Technische Universität München
|
| Jort Gemmeke; Katholieke Universiteit Leuven
|
| Antti Hurmalainen; Tampere University of Technology
|
| Tuomas Virtanen; Tampere University of Technology
|
| Gerhard Rigoll; Technische Universität München
|
| |
|
SP-P11.5: ASR-DRIVEN TOP-DOWN BINARY MASK ESTIMATION USING SPECTRAL PRIORS
|
| William Hartmann; The Ohio State University
|
| Eric Fosler-Lussier; The Ohio State University
|
| |
|
SP-P11.6: TWO-DIMENSIONAL FRAME-AND-FEATURE WEIGHTED VITERBI DECODING FOR ROBUST SPEECH RECOGNITION
|
| Chang Yang; National Taiwan University
|
| Lee Lin Shan; National Taiwan University
|
| |
|
SP-P11.7: COMBINING MISSING-DATA RECONSTRUCTION AND UNCERTAINTY DECODING FOR ROBUST SPEECH RECOGNITION
|
| Jose Andres Gonzalez Lopez; University of Granada
|
| Antonio Miguel Peinado Herreros; University of Granada
|
| Angel Manuel Gomez Garcia; University of Granada
|
| Ning Ma; University of Sheffield
|
| Jon Barker; University of Sheffield
|
| |
|
SP-P11.8: HISTOGRAM-BASED SUBBAND POWER WARPING AND SPECTRAL AVERAGING FOR ROBUST SPEECH RECOGNITION UNDER MATCHED AND MULTISTYLE TRAINING
|
| Mark Harvilla; Carnegie Mellon University
|
| Richard Stern; Carnegie Mellon University
|
| |
|
SP-P11.9: A COMPARISON OF FRONT-END COMPENSATION STRATEGIES FOR ROBUST LVCSR UNDER ROOM REVERBERATION AND INCREASED VOCAL EFFORT
|
| Seyed Omid Sadjadi; The University of Texas at Dallas
|
| Hynek Boril; The University of Texas at Dallas
|
| John H. L. Hansen; The University of Texas at Dallas
|
| |
|
SP-P11.10: STEREO-BASED STOCHASTIC MAPPING WITH CONTEXT USING PROBABILISTIC PCA FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
|
| Xiaodong Cui; IBM T.J. Watson Research Center
|
| Mohamed Afify; Orange Labs, France Telecom
|
| Bowen Zhou; IBM T.J. Watson Research Center
|
| |
|
SP-P11.11: NOISE AND SPEAKER COMPENSATION IN THE LOG FILTER BANK DOMAIN
|
| Vikas Joshi; Indian Institute of Technology, Madras
|
| Raghavendra Bilgi; Indian Institute of Technology, Madras
|
| Umesh Srinivasan; Indian Institute of Technology, Madras
|
| Luz Garcia Martinez; Universidad de Granada
|
| Carmen Benítez Ortúzar; University of Granada
|
| |
|
SP-P11.12: NOISE SUPPRESSION WITH UNSUPERVISED JOINT SPEAKER ADAPTATION AND NOISE MIXTURE MODEL ESTIMATION
|
| Masakiyo Fujimoto; NTT Communication Science Laboratories, NTT Corporation
|
| Shinji Watanabe; NTT Communication Science Laboratories, NTT Corporation
|
| Tomohiro Nakatani; NTT Communication Science Laboratories, NTT Corporation
|
| |