Kavalekalam et al., 2017 - Google Patents

Model based binaural enhancement of voiced and unvoiced speech

Kavalekalam et al., 2017

Document ID: 9796278269130800097
Author: Kavalekalam M; Christensen M; Boldt J
Publication year: 2017
Publication venue: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

External Links

Cited by

Snippet

This paper deals with the enhancement of speech in presence of non-stationary babble noise. A binaural speech enhancement framework is proposed which takes into account both the voiced and unvoiced speech production model. The usage of this model in …

Continue reading at vbn.aau.dk (PDF) (other versions)

238000004519 manufacturing process 0 abstract description 5

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0202—Applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building

Similar Documents

Publication	Publication Date	Title
Williamson et al.	2017	Time-frequency masking in the complex domain for speech dereverberation and denoising
Zhang et al.	2021	ADL-MVDR: All deep learning MVDR beamformer for target speech separation
Wang et al.	2020	Complex spectral mapping for single-and multi-channel speech enhancement and robust ASR
Zhao et al.	2018	Two-stage deep learning for noisy-reverberant speech enhancement
Taherian et al.	2020	Robust speaker recognition based on single-channel and multi-channel speech enhancement
Kuklasiński et al.	2016	Maximum likelihood PSD estimation for speech enhancement in reverberation and noise
Sadjadi et al.	2011	Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
Williamson et al.	2017	Speech dereverberation and denoising using complex ratio masks
Kavalekalam et al.	2018	Model-based speech enhancement for intelligibility improvement in binaural hearing aids
Hendriks et al.	2015	Optimal near-end speech intelligibility improvement incorporating additive noise and late reverberation under an approximation of the short-time SII
Luo	2022	A time-domain real-valued generalized wiener filter for multi-channel neural separation systems
Kavalekalam et al.	2016	Kalman filter for speech enhancement in cocktail party scenarios using a codebook-based approach
Dadvar et al.	2019	Robust binaural speech separation in adverse conditions based on deep neural network with modified spatial features and training target
Goetze et al.	2014	Speech quality assessment for listening-room compensation
Westhausen et al.	2023	Low bit rate binaural link for improved ultra low-latency low-complexity multichannel speech enhancement in hearing aids
Kavalekalam et al.	2017	Model based binaural enhancement of voiced and unvoiced speech
Aroudi et al.	2020	Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding
Gerkmann	2011	Cepstral weighting for speech dereverberation without musical noise
Miyazaki et al.	2011	Theoretical analysis of parametric blind spatial subtraction array and its application to speech recognition performance prediction
Li et al.	2023	A composite t60 regression and classification approach for speech dereverberation
Kavalekalam et al.	2016	Binaural speech enhancement using a codebook based approach
Kavalekalam et al.	2019	Hearing aid-controlled beamformer for binaural speech enhancement using a model-based approach
Ji et al.	2017	Coherence-Based Dual-Channel Noise Reduction Algorithm in a Complex Noisy Environment.
Schwarz et al.	2012	On blocking matrix-based dereverberation for automatic speech recognition
Zahedi et al.	2020	A constrained maximum likelihood estimator of speech and noise spectra with application to multi-microphone noise reduction