Sundar et al., 2012 - Google Patents

Identification of active sources in single-channel convolutive mixtures using known source models

Sundar et al., 2012

Document ID: 14970855798672495845
Author: Sundar H; Sreenivas T; Kellermann W
Publication year: 2012
Publication venue: IEEE Signal Processing Letters

External Links

Cited by

Snippet

We address the problem of identifying the constituent sources in a single-sensor mixture signal consisting of contributions from multiple simultaneously active sources. We propose a generic framework for mixture signal analysis based on a latent variable approach. The …

Continue reading at ieeexplore.ieee.org (other versions)

239000000203 mixture 0 title abstract description 81

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/028—Voice signal separating using properties of sound source
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6261—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation partitioning the feature space
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass

Similar Documents

Publication	Publication Date	Title
Bando et al.	2018	Statistical speech enhancement based on probabilistic integration of variational autoencoder and non-negative matrix factorization
Luo et al.	2018	Speaker-independent speech separation with deep attractor network
Gao et al.	2012	Unsupervised single-channel separation of nonstationary signals using gammatone filterbank and itakura–saito nonnegative matrix two-dimensional factorizations
Bando et al.	2021	Neural full-rank spatial covariance analysis for blind source separation
Richter et al.	2020	Speech Enhancement with Stochastic Temporal Convolutional Networks.
Adiloğlu et al.	2016	Variational Bayesian inference for source separation and robust feature extraction
JP5994639B2 (en)	2016-09-21	Sound section detection device, sound section detection method, and sound section detection program
Duong et al.	2018	Gaussian modeling-based multichannel audio source separation exploiting generic source spectral model
Bahari et al.	2017	Distributed multi-speaker voice activity detection for wireless acoustic sensor networks
Shashanka et al.	2007	Sparse overcomplete decomposition for single channel speaker separation
Sundar et al.	2012	Identification of active sources in single-channel convolutive mixtures using known source models
Doulaty et al.	2016	Automatic optimization of data perturbation distributions for multi-style training in speech recognition
WO2012105385A1 (en)	2012-08-09	Sound segment classification device, sound segment classification method, and sound segment classification program
Schmidt et al.	2007	Linear regression on sparse features for single-channel speech separation
JP5726790B2 (en)	2015-06-03	Sound source separation device, sound source separation method, and program
Du et al.	2021	Semi-supervised multichannel speech separation based on a phone-and speaker-aware deep generative model of speech spectrograms
Abdipour et al.	2015	Binaural source separation based on spatial cues and maximum likelihood model adaptation
WO2019194300A1 (en)	2019-10-10	Signal analysis device, signal analysis method, and signal analysis program
Subba Ramaiah et al.	2017	A novel approach for speaker diarization system using TMFCC parameterization and Lion optimization
Jafari et al.	2014	On the use of the Watson mixture model for clustering-based under-determined blind source separation.
Abbas et al.	2020	Enhancing Linear Independent Component Analysis: Comparison of Various Metaheuristic Methods.
Samui et al.	2017	Deep Recurrent Neural Network Based Monaural Speech Separation Using Recurrent Temporal Restricted Boltzmann Machines.
Zohny et al.	2014	Modelling interaural level and phase cues with Student's t-distribution for robust clustering in MESSL
Mirzaei et al.	2014	Blind audio source separation of stereo mixtures using bayesian non-negative matrix factorization
Chakraborty et al.	2012	Detection and positioning of overlapped sounds in a room environment