Kim, 2010 - Google Patents

Interference suppression using principal subspace modification in multichannel Wiener filter and its application to speech recognition

Kim, 2010

View PDF

Document ID: 7848294150140356897
Author: Kim G
Publication year: 2010
Publication venue: ETRI journal

External Links

Cited by

Snippet

It has been shown that the principal subspace‐based multichannel Wiener filter (MWF) provides better performance than the conventional MWF for suppressing interference in the case of a single target source. It can efficiently estimate the target speech component in the …

Continue reading at onlinelibrary.wiley.com (PDF) (other versions)

230000004048 modification 0 title abstract description 34

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K11/00—Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
- G10K11/16—Methods or devices for protecting against, or damping of, acoustic waves, e.g. sound
- G10K11/175—Methods or devices for protecting against, or damping of, acoustic waves, e.g. sound using interference effects; Masking sound
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building

Similar Documents

Publication	Publication Date	Title
Doclo et al.	2002	GSVD-based optimal filtering for single and multimicrophone speech enhancement
Cauchi et al.	2015	Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech
Gannot et al.	2008	Adaptive beamforming and postfiltering
US5574824A (en)	1996-11-12	Analysis/synthesis-based microphone array speech enhancer with variable signal distortion
Krueger et al.	2010	Speech enhancement with a GSC-like structure employing eigenvector-based transfer function ratios estimation
Xiao et al.	2016	Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation
DEREVERBERATION et al.	2014	REVERB Workshop 2014
Nakatani et al.	2019	Maximum likelihood convolutional beamformer for simultaneous denoising and dereverberation
Nesta et al.	2013	A flexible spatial blind source extraction framework for robust speech recognition in noisy environments
Nesta et al.	2013	Blind source extraction for robust speech recognition in multisource noisy environments
Zhao et al.	2015	Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction
Song et al.	2021	An integrated multi-channel approach for joint noise reduction and dereverberation
Zhang et al.	2014	Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation
Yousefian et al.	2009	Using power level difference for near field dual-microphone speech enhancement
Fischer et al.	2018	Robust constrained MFMVDR filtering for single-microphone speech enhancement
Hashemgeloogerdi et al.	2020	Joint beamforming and reverberation cancellation using a constrained Kalman filter with multichannel linear prediction
Huang et al.	2008	Dereverberation
Heitkaemper et al.	2018	Smoothing along frequency in online neural network supported acoustic beamforming
Li et al.	2018	Multichannel identification and nonnegative equalization for dereverberation and noise reduction based on convolutive transfer function
Sehr et al.	2008	Towards robust distant-talking automatic speech recognition in reverberant environments
Adcock	2001	Optimal filtering and speech recognition with microphone arrays
Delcroix et al.	2017	Multichannel speech enhancement approaches to DNN-based far-field speech recognition
Nakatani et al.	2019	Simultaneous denoising, dereverberation, and source separation using a unified convolutional beamformer
Kim	2010	Interference suppression using principal subspace modification in multichannel Wiener filter and its application to speech recognition
Chodingala et al.	2022	Robustness of DAS Beamformer Over MVDR for Replay Attack Detection On Voice Assistants