Kodrasi et al., 2018 - Google Patents

Analysis of eigenvalue decomposition-based late reverberation power spectral density estimation

Kodrasi et al., 2018

Document ID: 5034625945231284657
Author: Kodrasi I; Doclo S
Publication year: 2018
Publication venue: IEEE/ACM Transactions on Audio, Speech, and Language Processing

External Links

Cited by

Snippet

Many speech dereverberation techniques require an estimate of the late reverberation power spectral density (PSD). State-of-the-art multichannel methods for estimating the late reverberation PSD typically rely on first, an estimate of the relative transfer functions (RTFs) …

Continue reading at uol.de (PDF) (other versions)

238000000354 decomposition reaction 0 title abstract description 15

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
- H04R29/004—Monitoring arrangements; Testing arrangements for microphones
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/40—Arrangements for obtaining a desired directivity characteristic
- H04R25/407—Circuits for combining signals of a plurality of transducers
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/02—Circuits for transducers, loudspeakers or microphones for preventing acoustic reaction, i.e. acoustic oscillatory feedback

Similar Documents

Publication	Publication Date	Title
Kodrasi et al.	2018	Analysis of eigenvalue decomposition-based late reverberation power spectral density estimation
Williamson et al.	2017	Time-frequency masking in the complex domain for speech dereverberation and denoising
Braun et al.	2018	Evaluation and comparison of late reverberation power spectral density estimators
Kuklasiński et al.	2016	Maximum likelihood PSD estimation for speech enhancement in reverberation and noise
Kinoshita et al.	2009	Suppression of late reverberation effect on speech signal using long-term multiple-step linear prediction
Hendriks et al.	2011	Noise correlation matrix estimation for multi-microphone speech enhancement
Krueger et al.	2010	Speech enhancement with a GSC-like structure employing eigenvector-based transfer function ratios estimation
Cauchi et al.	2019	Non-intrusive speech quality prediction using modulation energies and LSTM-network
Wang et al.	2015	Noise power spectral density estimation using MaxNSR blocking matrix
Kodrasi et al.	2016	Joint dereverberation and noise reduction based on acoustic multi-channel equalization
Braun et al.	2016	Online dereverberation for dynamic scenarios using a Kalman filter with an autoregressive model
Kuklasiński et al.	2014	Maximum likelihood based multi-channel isotropic reverberation reduction for hearing aids
Mohammadiha et al.	2015	Speech dereverberation using non-negative convolutive transfer function and spectro-temporal modeling
Marquardt et al.	2018	Interaural coherence preservation for binaural noise reduction using partial noise estimation and spectral postfiltering
Kodrasi et al.	2018	Joint late reverberation and noise power spectral density estimation in a spatially homogeneous noise field
Zhao et al.	2015	Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction
Thuene et al.	2016	Maximum-likelihood approach to adaptive multichannel-Wiener postfiltering for wind-noise reduction
Kodrasi et al.	2017	EVD-based multi-channel dereverberation of a moving speaker using different RETF estimation methods
Kodrasi et al.	2017	Late reverberant power spectral density estimation based on an eigenvalue decomposition
Habets et al.	2018	Dereverberation
Li et al.	2019	Multichannel online dereverberation based on spectral magnitude inverse filtering
Mirabilii et al.	2019	Multi-channel wind noise reduction using the Corcos model
Hoang et al.	2022	Multichannel speech enhancement with own voice-based interfering speech suppression for hearing assistive devices
Mirabilii et al.	2020	Spatial coherence-aware multi-channel wind noise reduction
Tammen et al.	2019	Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares