Nothing Special   »   [go: up one dir, main page]

Sundar et al., 2012 - Google Patents

Identification of active sources in single-channel convolutive mixtures using known source models

Sundar et al., 2012

Document ID
14970855798672495845
Author
Sundar H
Sreenivas T
Kellermann W
Publication year
Publication venue
IEEE Signal Processing Letters

External Links

Snippet

We address the problem of identifying the constituent sources in a single-sensor mixture signal consisting of contributions from multiple simultaneously active sources. We propose a generic framework for mixture signal analysis based on a latent variable approach. The …
Continue reading at ieeexplore.ieee.org (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • G10L21/028Voice signal separating using properties of sound source
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/66Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06K9/6261Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation partitioning the feature space
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass

Similar Documents

Publication Publication Date Title
Bando et al. Statistical speech enhancement based on probabilistic integration of variational autoencoder and non-negative matrix factorization
Luo et al. Speaker-independent speech separation with deep attractor network
Gao et al. Unsupervised single-channel separation of nonstationary signals using gammatone filterbank and itakura–saito nonnegative matrix two-dimensional factorizations
Bando et al. Neural full-rank spatial covariance analysis for blind source separation
Richter et al. Speech Enhancement with Stochastic Temporal Convolutional Networks.
Adiloğlu et al. Variational Bayesian inference for source separation and robust feature extraction
JP5994639B2 (en) Sound section detection device, sound section detection method, and sound section detection program
Duong et al. Gaussian modeling-based multichannel audio source separation exploiting generic source spectral model
Bahari et al. Distributed multi-speaker voice activity detection for wireless acoustic sensor networks
Shashanka et al. Sparse overcomplete decomposition for single channel speaker separation
Sundar et al. Identification of active sources in single-channel convolutive mixtures using known source models
Doulaty et al. Automatic optimization of data perturbation distributions for multi-style training in speech recognition
WO2012105385A1 (en) Sound segment classification device, sound segment classification method, and sound segment classification program
Schmidt et al. Linear regression on sparse features for single-channel speech separation
JP5726790B2 (en) Sound source separation device, sound source separation method, and program
Du et al. Semi-supervised multichannel speech separation based on a phone-and speaker-aware deep generative model of speech spectrograms
Abdipour et al. Binaural source separation based on spatial cues and maximum likelihood model adaptation
WO2019194300A1 (en) Signal analysis device, signal analysis method, and signal analysis program
Subba Ramaiah et al. A novel approach for speaker diarization system using TMFCC parameterization and Lion optimization
Jafari et al. On the use of the Watson mixture model for clustering-based under-determined blind source separation.
Abbas et al. Enhancing Linear Independent Component Analysis: Comparison of Various Metaheuristic Methods.
Samui et al. Deep Recurrent Neural Network Based Monaural Speech Separation Using Recurrent Temporal Restricted Boltzmann Machines.
Zohny et al. Modelling interaural level and phase cues with Student's t-distribution for robust clustering in MESSL
Mirzaei et al. Blind audio source separation of stereo mixtures using bayesian non-negative matrix factorization
Chakraborty et al. Detection and positioning of overlapped sounds in a room environment