Corey et al., 2018 - Google Patents

Relative transfer function estimation from speech keywords

Corey et al., 2018

Document ID: 6623428539236143063
Author: Corey R; Singer A
Publication year: 2018
Publication venue: International Conference on Latent Variable Analysis and Signal Separation

External Links

Cited by

Snippet

Far-field speech capture systems rely on microphone arrays to spatially filter sound, attenuating unwanted interference and noise and enhancing a speech signal of interest. To design effective spatial filters, we must first estimate the acoustic transfer functions between …

Continue reading at corey1.web.engr.illinois.edu (PDF) (other versions)

238000000034 method 0 abstract description 16

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones

Similar Documents

Publication	Publication Date	Title
US11354536B2 (en)	2022-06-07	Acoustic source separation systems
US9668066B1 (en)	2017-05-30	Blind source separation systems
Vu et al.	2010	Blind speech separation employing directional statistics in an expectation maximization framework
US9008329B1 (en)	2015-04-14	Noise reduction using multi-feature cluster tracker
US11257512B2 (en)	2022-02-22	Adaptive spatial VAD and time-frequency mask estimation for highly non-stationary noise sources
Schädler et al.	2015	Separable spectro-temporal Gabor filter bank features: Reducing the complexity of robust features for automatic speech recognition
Mohammadiha et al.	2015	Speech dereverberation using non-negative convolutive transfer function and spectro-temporal modeling
Koldovský et al.	2013	Semi-blind noise extraction using partially known position of the target source
Jarrett et al.	2014	Noise reduction in the spherical harmonic domain using a tradeoff beamformer and narrowband DOA estimates
Zhao et al.	2015	Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction
Habets et al.	2018	Dereverberation
Li et al.	2019	Speech enhancement algorithm based on sound source localization and scene matching for binaural digital hearing aids
Corey et al.	2018	Relative transfer function estimation from speech keywords
Li et al.	2018	Multichannel identification and nonnegative equalization for dereverberation and noise reduction based on convolutive transfer function
Astapov et al.	2018	Far field speech enhancement at low SNR in presence of nonstationary noise based on spectral masking and MVDR beamforming
Čmejla et al.	2018	Independent vector analysis exploiting pre-learned banks of relative transfer functions for assumed target’s positions
Al-Karawi et al.	2024	The effects of distance and reverberation time on speaker recognition performance
Fontaine et al.	2018	Multichannel audio modeling with elliptically stable tensor decomposition
Hammer et al.	2020	FCN approach for dynamically locating multiple speakers
Prasanna Kumar et al.	2015	Supervised and unsupervised separation of convolutive speech mixtures using f 0 and formant frequencies
Salishev et al.	2017	Microphone array post-filter in frequency domain for speech recognition using short-time log-spectral amplitude estimator and spectral harmonic/noise classifier
Thakallapalli et al.	2020	Spectral features derived from single frequency filter for multispeaker localization
Chen et al.	2016	Localization of sound sources with known statistics in the presence of interferers
Venkatesan et al.	2020	Analysis of monaural and binaural statistical properties for the estimation of distance of a target speaker
Di Persia et al.	2011	Correlated postfiltering and mutual information in pseudoanechoic model based blind source separation