Speech processing in modern communication: challenges and perspectives
Cohen, Israel
Benesty, Jacob
Gannot, Sharon
Modern communication devices, such as mobile phones, teleconferencing systems, VoIP, etc., are often used in noisy and reverberant environments. Therefore,signals picked up by the microphones from telecommunication devices contain not only the desired near-end speech signal, but also interferences such as thebackground noise, far-end echoes produced by the loudspeaker, and reverberations of the desired source. These interferences degrade the fidelity and intelligibility of the near-end speech in human-to-human telecommunications and decrease the performance of human-to-machine interfaces (i.e., automatic speech recognition systems). The proposed book deals with the fundamental challenges ofspeech processing in modern communication, including speech enhancement, interference suppression, acoustic echo cancellation, relative transfer function identification, source localization, dereverberation, and beamforming in reverberant environments. Enhancement of speech signals is necessary whenever the source signal is corrupted by noise. In highly non-stationary noise environments, noise transients, and interferences may be extremely annoying. Acoustic echocancellation is used to eliminate the acoustic coupling between the loudspeaker and the microphone of a communication device. Identification of the relative transfer function between sensors in response to a desired speech signal enables to derive a reference noise signal for suppressing directional or coherent noise sources. Source localization, dereverberation, and beamforming in reverberant environments further enable to increase the intelligibility of the near-end speech signal. Delivers timely overview of the fundamental challenges inmodern speech communication systems Provides concise insights into recent research topics INDICE: Linear System Identification in the Short-Time Fourier Transform Domain.- Identification of the Relative Transfer Function between Sensors in the Short-Time Fourier Transform Domain.- Representation and Identification of Nonlinear Systems in the Short-Time Fourier Transform Domain.- Variable Step-Size Adaptive Filters for Echo Cancellation.- Simultaneous Detection and Estimation Approach for Speech Enhancement and Interference Suppression.- Speech Dereverberation and Denoising Based on Time Varying Speech Model and Autoregressive Reverberation Model.- Codebook Approaches for Single Sensor Speech/Music Separation.- Microphone Arrays: Fundamental Concepts.- The MVDR Beamformer for Speech Enhancement.- Extraction of Desired Speech Signals in Multiple-Speaker Reverberant Noisy Environments.- Spherical Microphone Array Beamforming.- Steered Beamforming Approaches for Acoustic Source Localization
- ISBN: 978-3-642-11129-7
- Editorial: Springer
- Encuadernacion: Cartoné
- Páginas: 342
- Fecha Publicación: 01/01/2010
- Nº Volúmenes: 1
- Idioma: Inglés