Technical Program

Note: Times and locations are subject to change. A Paper Search function is also available.

Sunday, March 25
13:30 - 16:30
Tutorial: T1: The Voice Behind the Speech: Speaker States, Traits, and Vocal Behavior Room G
Tutorial: T2: Speech Modeling and Enhancement Using Diffusion Maps Room C-2
Tutorial: T3: Signal Processing Meets Network Layers: Joint Protocol-Channel Decoding Room H
Tutorial: T4: Learning in the Context of Set Theoretic Estimation: An Efficient and Unifying Framework for Adaptive Machine Learning and Signal Processing Room J
Tutorial: T5: Introduction to Business Analytics Room K
Workshop: SpeechOcean Room G
Monday, March 26
09:30 - 12:30
Tutorial: T6: Reverberant Speech Processing for Human Communication and Automatic Speech Recognition Room C-2
Tutorial: T7: Distributed Data Fusion for Interactive Cognitive Environments Room H
Tutorial: T8: Very Large MIMO Systems Room C-1
Tutorial: T9: Deep Learning and Its Applications in Signal Processing Room D
Tutorial: T13: Biological Signal Processing and Molecular Network Informatics Room K
09:30 - 17:00
Workshop: Mathworks Room I
Workshop: Texas Instruments Room F
14:00 - 17:00
Tutorial: T11: Convex and Non-convex Approaches for Low-dimensional Models Room C-2
Tutorial: T12: Bio-Inspired Cognition, Adaptation, and Learning over Networks Room H
Tutorial: T10: Bayesian Learning for Speech and Language Processing Room D
Tutorial: T14: Fourier and Wavelet Signal Processing: Teaching SP with Geometry Room C-1
Tutorial: T15: 3D Video Coding & Distribution Room K
Workshop: Nuance Room G
17:30 - 19:30
Welcome Reception Sakura, Swan, Main Lounge, Cocktail Lounge
Tuesday, March 27
09:30 - 11:30
Opening Ceremony, Awards Ceremony Main Hall
11:30 - 12:30
Plenary: Karlheinz Brandenburg (Technische Universitat Ilmenau, Germany): Audio and Acoustics Signal Processing: the Quest for High Fidelity continues Main Hall
14:00 - 16:00
SP-L1: Speech Analysis I Room B-1
AASP-L1: Loudspeaker and Microphone Array Signal Processing Room B-2
Special Session: SS-L1: Resource-aware design of multiple radar systems Room E
IVMSP-L1: Image Coding Room D
SPCOM-L1: Beamforming and MIMO Room C-1
SPTM-L1: Detection Theory and Methods Room C-2
SP-P1: Speaker ID I Poster Area A
SP-P2: Machine Learning and Acoustic Modeling Poster Area B
AASP-P1: Perception and Echo Cancellation Poster Area C
SLP-P1: Language Modeling Poster Area D
IVMSP-P1: Video Segmentation and Tracking Poster Area E
SPCOM-P1: Distributed Optimization and Resource Allocation Poster Area F
SPTM-P1: Joint SPTM/SPCOM Session: Sampling, Sparsity and Reconstruction I Poster Area G
BISP-P1: Biomedical Signal Processing I Poster Area H
16:30 - 18:30
SP-L2: Speech Synthesis I Room B-1
AASP-L2: Echo Cancellation Room B-2
Special Session: SS-L2: Distributed Transmit Beamforming Room E
IVMSP-L2: Depth Estimation Room D
SPCOM-L2: Resource Allocation and Interference Management Room C-1
SPTM-L2: Adaptive Filtering Room C-2
SP-P3: Adaptation in ASR Poster Area A
MLSP-P1: Applications in Audio and Speech Processing Poster Area B
AASP-P2: Loudspeaker and Microphone Array Signal Processing Poster Area C
IVMSP-P2: Image and Video Retrieval Poster Area D
IVMSP-P3: Image Denoising, Restoration, and Enhancement Poster Area E
SPCOM-P2: Sampling, Coding and Modulation Poster Area F
SPTM-P2: Filter Banks and Filter Design Poster Area G
IFS-P1: Watermarking and data hiding Poster Area H
Wednesday, March 28
09:00 - 10:00
Plenary: Chin-Hui Lee (Georgia Institute of Technology, USA): From Signal Processing to Information Extraction of Speech: A New Perspective on Automatic Speech Recognition Main Hall
10:30 - 12:30
Special Session: SS-L3: Large-Scale Optimization for Signal Processing and Speech Recognition Room B-1
SLP-L1: Speech Translation Room B-2
Special Session: SS-L4: Controlled Sensing for Inference Room E
IVMSP-L3: Video Coding I Room D
SPCOM-L3: Sensor Networks Room C-1
SPTM-L3: Compressed Sensing and Sparsity I Room C-2
SP-P4: Speaker ID II Poster Area A
SP-P5: LVCSR Poster Area B
AASP-P3: Source Separation Poster Area C
MMSP-P1: Multimedia Communication and Networking Poster Area D
IVMSP-P4: Image Filtering Poster Area E
SPCOM-P3: Communication Systems I Poster Area F
SPTM-P3: Estimation Methods and Performance Bounds Poster Area G
BISP-P2: Electro- and magnetoencephalography Poster Area H
ST-1.1: Simple approach to assessing penile endothelial function in young individuals Annex Hall
ST-1.2: Multiscale Entropy Analysis of Pulse Wave Velocity for Assessing Atherosclerosis Annex Hall
14:00 - 16:00
SP-L3: Speech Enhancement I Room B-1
AASP-L3: Source Separation: Music and Speech Room B-2
Special Session: SS-L5: Digitally Enhanced Analog Systems Room E
IVMSP-L4: Interpolation and Super-resolution Room D
SPCOM-L4: Sparse Signal Processing for Communications and Networking Room C-1
SPTM-L4: Compressed Sensing and Sparsity II Room C-2
SP-P6: Speech Analysis II Poster Area A
MLSP-P2: Learning Theory Poster Area B
AASP-P4: Noise Reduction and Source Separation Poster Area C
SLP-P2: Spoken Language Understanding Poster Area D
IVMSP-P5: Image Segmentation and Quality Assessment Poster Area E
SAM-P1: Applications of Sensor Array and Multichannel Signal Processing Poster Area F
SPTM-P4: Adaptive Filtering and Nonlinear Systems Poster Area G
IFS-P2: Multimedia identification and authentication Poster Area H
ST-2.1: Compressed Sensing Prototype for Flexible Wireless System Annex Hall
ST-2.3: Configurable 3D Audio and Music Annex Hall
16:30 - 18:30
SP-L4: Speech Enhancement II Room B-1
AASP-L4: Music: Classification and Recognition Room B-2
Special Session: SS-L6: Analog Implementation Issues of Analog to Information Systems Room E
IVMSP-L5: Image Segmentation Room D
SPCOM-L5: Relay Networks Room C-1
DISPS-L1: Parallel and embedded signal processing systems Room C-2
SP-P7: Speech Synthesis II Poster Area A
MLSP-P3: Classification and Clustering Poster Area B
ITT-P1: Technology to Practice for Signal Processing Poster Area C
SLP-P3: Paralinguistic, Nonlinguistic Information and Data Mining Poster Area D
IVMSP-P6: Video Coding II Poster Area E
SPCOM-P4: Social Networks, Smart Grid and Other Emerging Applications Poster Area F
SPTM-P5: Compressed Sensing and Sparsity III Poster Area G
IFS-P3: Biometrics and network security Poster Area H
19:00 - 21:30
Banquet Grand Prince Hotel Kyoto 
Thursday, March 29
09:00 - 10:00
Plenary: Stephane Mallat (Ecole Polytechnique, France): Can Signal Classification Speak Mathematics? Main Hall
10:30 - 12:30
SP-L5: Acoustic Modeling I Room B-1
MLSP-L1: Blind Signal Separation and Matrix Factorizations Room B-2
Special Session: SS-L7: Signal and Information Processing for 'Big Data' Room E
IVMSP-L6: Image Analysis I Room D
SAM-L1: Signal Separation Room C-1
MMSP-L1: Multimedia Security and Forensics Room C-2
SP-P8: Speech Enhancement III Poster Area A
SP-P9: Pitch, Prosody, and Voice Quality Poster Area B
SPED-P1: Signal Processing Education Poster Area C
DISPS-P1: Communication, error correction, and navigation systems Poster Area D
IVMSP-P7: Remote Sensing Poster Area E
SPCOM-P5: Cognitive Radio and Sensor Networks Poster Area F
SPTM-P6: Joint SPTM/SPCOM Session: Sampling Sparsity and Reconstruction II Poster Area G
SAM-P2: DOA Estimation and Beamforming Poster Area H
ST-3.1: Real-Time Noise Reduction for Dual-Microphone Mobile Phones Annex Hall
ST-3.2: Real-time audio-visual meeting recognition and understanding using distant microphone array Annex Hall
ST-3.3: A Microphone Array for Tablet Computers and Smart Phones Annex Hall
13:00 - 16:00
Workshop: ICASSP2012 Room I
14:00 - 16:00
SP-L6: Robust ASR I Room B-1
SLP-L2: Spoken and Multimodal Dialog Systems and Applications Room B-2
Special Session: SS-L8: Learning with Music Signal Room E
IVMSP-L7: Image Enhancement Room D
SAM-L2: Detection and Estimation Room C-1
ITT-L1: Emerging Applications and Industry Technologies Room C-2
SP-P10: Speech Enhancement IV Poster Area A
MLSP-P4: Learning Theory and Models Poster Area B
AASP-P5: Audio Analysis and Synthesis Poster Area C
MMSP-P2: Multimedia Recognition, Search, and Retrieval Poster Area D
IVMSP-P8: Image Resizing and Reconstruction Poster Area E
SPCOM-P6: Communication Systems II Poster Area F
SPTM-P7: Signal and System Modeling and Estimation I Poster Area G
BISP-P3: Biomedical Image Processing Poster Area H
ST-4.1: Haptic Voice Recognition: A Robust Multimodal Interface for Mobile Devices Annex Hall
ST-4.2: Hanna's tourist guide for Kyoto Annex Hall
ST-4.3: Network-based Speech-to-Speech Translation for Multiparty Chat Annex Hall
16:30 - 18:30
SP-L7: Acoustic Modeling III Room B-1
AASP-L5: Source Separation and Signal Enhancement Room B-2
Special Session: SS-L9: Advances in singing-voice synthesis, transformation, and application Room E
IFS-L1: Multimedia forensics Room D
SAM-L3: MIMO Radar Room C-1
SPTM-L5: Estimation Methods and Applications I Room C-2
SP-P11: Robust ASR II Poster Area A
MLSP-P5: Bayesian, Information-theoretic and Graphical Learning Poster Area B
AASP-P6: Spatial Audio and Audio Coding Poster Area C
MMSP-P3: Audiovisual and Multimodal Processing Poster Area D
IVMSP-P9: Feature Extraction and Analysis Poster Area E
SPCOM-P7: Joint SPCOM/SAM Session: Communication Networks Poster Area F
SPTM-P8: Joint SPTM/SPCOM Session: Adaptive Filtering Poster Area G
BISP-P4: Biomedical Signal Processing II Poster Area H
Friday, March 30
09:00 - 10:00
Plenary: Mitsuo Kawato (ATR Computational Neuroscience Laboratories, Japan): Computational Neuroscience, Brain Decoding and Neurofeedback Main Hall
10:30 - 12:30
SP-L8: Alternative ASR Methods Room B-1
MLSP-L2: Learning Theory Room B-2
Special Session: SS-L10: Analysis Sparsity Room E
IFS-L2: Watermarking Room D
SAM-L4: Joint SAM/SPCOM Session: Relay-Assisted Communication Room C-1
SPTM-L6: Classification and Pattern Recognition Room C-2
SP-P12: Acoustic Modeling II Poster Area A
SP-P13: Speaker Verification Poster Area B
AASP-P7: Music: Classification and Recognition Poster Area C
SLP-P4: Speech Retrieval Poster Area D
IVMSP-P10: Image and Video Coding and Transmission Poster Area E
SAM-P3: Source Localization and Tracking Poster Area F
SPTM-P9: Sampling and Reconstruction Poster Area G
DISPS-P2: Algorithm and architecture optimization for signal processing Poster Area H
14:00 - 16:00
SP-L9: Speaker Diarization Room B-1
MLSP-L3: Applications in Audio, Speech, and Image Processing Room B-2
Special Session: SS-L11: Signal-Processing Challenges and Opportunities in Depth Cameras Room E
IVMSP-L8: Image Feature Extraction Room D
BISP-L1: Biomedical Imaging Room C-1
MMSP-L2: Joint Audio Visual Processing Room C-2
SP-P14: General Topics in Speech Recognition Poster Area A
MLSP-P6: Applications in Image Processing and Biomedicine Poster Area B
AASP-P8: Content Analysis for Music, Multimedia, and Medicine Poster Area C
IVMSP-P11: Image and Video Analysis, Processing, and Recognition Poster Area D
IVMSP-P12: 3-D Processing and Coding Poster Area E
SAM-P4: Sensor Networks and Distributed Estimation Poster Area F
SPTM-P10: Estimation Methods and Applications II Poster Area G
SPTM-P11: Signal and System Modeling and Estimation II Poster Area H
16:30 - 18:30
SP-L10: Weighted Finite-State Transducers Room B-1
AASP-L6: Music: Transcription, Separation and Transformations Room B-2
Special Session: SS-L12: Ray and Sound Reproducing in 3D Space Room E
IVMSP-L9: Visual Search and Annotation Room D
BISP-L2: Biomedical Signal Processing Room C-1
SPTM-L7: Time-Frequency Analysis and Applications Room C-2
SP-P15: Multilingual ASR and Language ID Poster Area A
SP-P16: Robust Speech Processing and Confidence Measures Poster Area B
AASP-P9: Room Acoustics and Acoustic System Modeling Poster Area C
IVMSP-P13: Motion Estimation, Registration, and Tracking Poster Area D
IVMSP-P14: Image and Video Modeling, Biometrics, and Applications Poster Area E
SAM-P5: Joint SAM/SPTM Session: Compressed Sensing and Sparse Signal Modeling Poster Area F
SPTM-P12: Detection and Estimation Theory and Methods Poster Area G