SP-P16: Robust Speech Processing and Confidence Measures |
Session Type: Poster |
Time: Friday, March 30, 16:30 - 18:30 |
Location: Poster Area B
|
Session Chairs: Nitish Murthy, Texas Instruments and John H. L. Hansen, University of Texas at Dallas
|
|
SP-P16.1: ARTIFICIAL STEREO DATA GENERATION FOR SPEECH FEATURE MAPPING
|
Chang Woo Han; Seoul National University
|
Tae Gyoon Kang; Seoul National University
|
Shin Jae Kang; Seoul National University
|
June Sig Sung; Seoul National University
|
Nam Soo Kim; Seoul National University
|
|
SP-P16.2: A TWO-MICROPHONE BASED VOICE ACTIVITY DETECTION FOR DISTANT-TALKING SPEECH IN WIDE RANGE OF DIRECTION OF ARRIVAL
|
Yanmeng Guo; Chinese Academy of Sciences
|
Kai Li; Chinese Academy of Sciences
|
Qiang Fu; Chinese Academy of Sciences
|
Yonghong Yan; Chinese Academy of Sciences
|
|
SP-P16.3: ON USING THE AUDITORY IMAGE MODEL AND INVARIANT-INTEGRATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
|
Florian Müller; University of Lübeck
|
Alfred Mertins; University of Lübeck
|
|
SP-P16.4: INTEGRATION OF BEAMFORMING AND AUTOMATIC SPEECH RECOGNITION THROUGH PROPAGATION OF THE WIENER POSTERIOR
|
Ramon F. Astudillo; Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa
|
Alberto Abad; Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa
|
Joao Neto; Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa
|
|
SP-P16.5: TIME-VARYING RESIDUAL NOISE FEATURE MODEL ESTIMATION FOR MULTI-MICROPHONE SPEECH RECOGNITION
|
Takuya Yoshioka; NTT Corporation
|
Emmanuel Ternon; NTT Corporation
|
Tomohiro Nakatani; NTT Corporation
|
|
SP-P16.6: MUSIC MODELS FOR MUSIC-SPEECH SEPARATION
|
Thad Hughes; Google, Inc.
|
Trausti Kristjansson; Google, Inc.
|
|
SP-P16.7: MODEL-BASED NOISE REDUCTION LEVERAGING FREQUENCY-WISE CONFIDENCE METRIC FOR IN-CAR SPEECH RECOGNITION
|
Osamu Ichikawa; International Business Machines
|
Steven Rennie; International Business Machines
|
Takashi Fukuda; IBM Research - Tokyo
|
Masafumi Nishimura; International Business Machines
|
|
SP-P16.8: ERROR TYPE CLASSIFICATION AND WORD ACCURACY ESTIMATION USING ALIGNMENT FEATURES FROM WORD CONFUSION NETWORK
|
Atsunori Ogawa; NTT Corporation
|
Takaaki Hori; NTT Corporation
|
Atsushi Nakamura; NTT Corporation
|
|
SP-P16.9: TOWARDS A DOMAIN-INDEPENDENT ASR-CONFIDENCE CLASSIFIER
|
Om D Deshmukh; IBM
|
Etienne Marcheret; IBM
|
Ashish Verma; IBM
|
|
SP-P16.10: CRF-BASED CONFIDENCE MEASURES OF RECOGNIZED CANDIDATES FOR LATTICE-BASED AUDIO INDEXING
|
Zhijian Ou; Tsinghua University
|
Huaqing Luo; Tsinghua University
|
|
SP-P16.11: CORPUS-INDEPENDENT HISTORY COMPRESSION FOR STOCHASTIC TURN-TAKING MODELS
|
Kornel Laskowski; Carnegie Mellon University
|
Elizabeth Shriberg; Microsoft Speech Labs
|
|
SP-P16.12: PROFLIFELOG: ENVIRONMENTAL ANALYSIS AND KEYWORD RECOGNITION FOR NATURALISTIC DAILY AUDIO STREAMS
|
Abhijeet Sangwan; Center for Robust Speech Systems (CRSS)
|
Ali Ziaei; Center for Robust Speech Systems (CRSS)
|
John H. L. Hansen; Center for Robust Speech Systems (CRSS)
|
|