Sundar et al., 2012 - Google Patents
Identification of active sources in single-channel convolutive mixtures using known source modelsSundar et al., 2012
- Document ID
- 14970855798672495845
- Author
- Sundar H
- Sreenivas T
- Kellermann W
- Publication year
- Publication venue
- IEEE Signal Processing Letters
External Links
Snippet
We address the problem of identifying the constituent sources in a single-sensor mixture signal consisting of contributions from multiple simultaneously active sources. We propose a generic framework for mixture signal analysis based on a latent variable approach. The …
- 239000000203 mixture 0 title abstract description 81
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/028—Voice signal separating using properties of sound source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6261—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation partitioning the feature space
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Bando et al. | Statistical speech enhancement based on probabilistic integration of variational autoencoder and non-negative matrix factorization | |
Luo et al. | Speaker-independent speech separation with deep attractor network | |
Gao et al. | Unsupervised single-channel separation of nonstationary signals using gammatone filterbank and itakura–saito nonnegative matrix two-dimensional factorizations | |
Bando et al. | Neural full-rank spatial covariance analysis for blind source separation | |
Richter et al. | Speech Enhancement with Stochastic Temporal Convolutional Networks. | |
Adiloğlu et al. | Variational Bayesian inference for source separation and robust feature extraction | |
JP5994639B2 (en) | Sound section detection device, sound section detection method, and sound section detection program | |
Duong et al. | Gaussian modeling-based multichannel audio source separation exploiting generic source spectral model | |
Bahari et al. | Distributed multi-speaker voice activity detection for wireless acoustic sensor networks | |
Shashanka et al. | Sparse overcomplete decomposition for single channel speaker separation | |
Sundar et al. | Identification of active sources in single-channel convolutive mixtures using known source models | |
Doulaty et al. | Automatic optimization of data perturbation distributions for multi-style training in speech recognition | |
WO2012105385A1 (en) | Sound segment classification device, sound segment classification method, and sound segment classification program | |
Schmidt et al. | Linear regression on sparse features for single-channel speech separation | |
JP5726790B2 (en) | Sound source separation device, sound source separation method, and program | |
Du et al. | Semi-supervised multichannel speech separation based on a phone-and speaker-aware deep generative model of speech spectrograms | |
Abdipour et al. | Binaural source separation based on spatial cues and maximum likelihood model adaptation | |
WO2019194300A1 (en) | Signal analysis device, signal analysis method, and signal analysis program | |
Subba Ramaiah et al. | A novel approach for speaker diarization system using TMFCC parameterization and Lion optimization | |
Jafari et al. | On the use of the Watson mixture model for clustering-based under-determined blind source separation. | |
Abbas et al. | Enhancing Linear Independent Component Analysis: Comparison of Various Metaheuristic Methods. | |
Samui et al. | Deep Recurrent Neural Network Based Monaural Speech Separation Using Recurrent Temporal Restricted Boltzmann Machines. | |
Zohny et al. | Modelling interaural level and phase cues with Student's t-distribution for robust clustering in MESSL | |
Mirzaei et al. | Blind audio source separation of stereo mixtures using bayesian non-negative matrix factorization | |
Chakraborty et al. | Detection and positioning of overlapped sounds in a room environment |