SP-P16: Robust Speech Processing and Confidence Measures |
| Session Type: Poster |
| Time: Friday, March 30, 16:30 - 18:30 |
| Location: Poster Area B
|
| Session Chairs: Nitish Murthy, Texas Instruments and John H. L. Hansen, University of Texas at Dallas
|
| |
|
SP-P16.1: ARTIFICIAL STEREO DATA GENERATION FOR SPEECH FEATURE MAPPING
|
| Chang Woo Han; Seoul National University
|
| Tae Gyoon Kang; Seoul National University
|
| Shin Jae Kang; Seoul National University
|
| June Sig Sung; Seoul National University
|
| Nam Soo Kim; Seoul National University
|
| |
|
SP-P16.2: A TWO-MICROPHONE BASED VOICE ACTIVITY DETECTION FOR DISTANT-TALKING SPEECH IN WIDE RANGE OF DIRECTION OF ARRIVAL
|
| Yanmeng Guo; Chinese Academy of Sciences
|
| Kai Li; Chinese Academy of Sciences
|
| Qiang Fu; Chinese Academy of Sciences
|
| Yonghong Yan; Chinese Academy of Sciences
|
| |
|
SP-P16.3: ON USING THE AUDITORY IMAGE MODEL AND INVARIANT-INTEGRATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
|
| Florian Müller; University of Lübeck
|
| Alfred Mertins; University of Lübeck
|
| |
|
SP-P16.4: INTEGRATION OF BEAMFORMING AND AUTOMATIC SPEECH RECOGNITION THROUGH PROPAGATION OF THE WIENER POSTERIOR
|
| Ramon F. Astudillo; Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa
|
| Alberto Abad; Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa
|
| Joao Neto; Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa
|
| |
|
SP-P16.5: TIME-VARYING RESIDUAL NOISE FEATURE MODEL ESTIMATION FOR MULTI-MICROPHONE SPEECH RECOGNITION
|
| Takuya Yoshioka; NTT Corporation
|
| Emmanuel Ternon; NTT Corporation
|
| Tomohiro Nakatani; NTT Corporation
|
| |
|
SP-P16.6: MUSIC MODELS FOR MUSIC-SPEECH SEPARATION
|
| Thad Hughes; Google, Inc.
|
| Trausti Kristjansson; Google, Inc.
|
| |
|
SP-P16.7: MODEL-BASED NOISE REDUCTION LEVERAGING FREQUENCY-WISE CONFIDENCE METRIC FOR IN-CAR SPEECH RECOGNITION
|
| Osamu Ichikawa; International Business Machines
|
| Steven Rennie; International Business Machines
|
| Takashi Fukuda; IBM Research - Tokyo
|
| Masafumi Nishimura; International Business Machines
|
| |
|
SP-P16.8: ERROR TYPE CLASSIFICATION AND WORD ACCURACY ESTIMATION USING ALIGNMENT FEATURES FROM WORD CONFUSION NETWORK
|
| Atsunori Ogawa; NTT Corporation
|
| Takaaki Hori; NTT Corporation
|
| Atsushi Nakamura; NTT Corporation
|
| |
|
SP-P16.9: TOWARDS A DOMAIN-INDEPENDENT ASR-CONFIDENCE CLASSIFIER
|
| Om D Deshmukh; IBM
|
| Etienne Marcheret; IBM
|
| Ashish Verma; IBM
|
| |
|
SP-P16.10: CRF-BASED CONFIDENCE MEASURES OF RECOGNIZED CANDIDATES FOR LATTICE-BASED AUDIO INDEXING
|
| Zhijian Ou; Tsinghua University
|
| Huaqing Luo; Tsinghua University
|
| |
|
SP-P16.11: CORPUS-INDEPENDENT HISTORY COMPRESSION FOR STOCHASTIC TURN-TAKING MODELS
|
| Kornel Laskowski; Carnegie Mellon University
|
| Elizabeth Shriberg; Microsoft Speech Labs
|
| |
|
SP-P16.12: PROFLIFELOG: ENVIRONMENTAL ANALYSIS AND KEYWORD RECOGNITION FOR NATURALISTIC DAILY AUDIO STREAMS
|
| Abhijeet Sangwan; Center for Robust Speech Systems (CRSS)
|
| Ali Ziaei; Center for Robust Speech Systems (CRSS)
|
| John H. L. Hansen; Center for Robust Speech Systems (CRSS)
|
| |