Nothing Special   »   [go: up one dir, main page]

Yu et al., 2010 - Google Patents

Automatic beamforming for blind extraction of speech from music environment using variance of spectral flux-inspired criterion

Yu et al., 2010

View PDF
Document ID
7064054465510776848
Author
Yu T
Hansen J
Publication year
Publication venue
IEEE Journal of Selected Topics in Signal Processing

External Links

Snippet

This paper addresses the problem of automatic beamforming for blind extraction of speech in a music environment, using multiple microphones. A new criterion is proposed based on the variance of the spectral flux (VSF), which is shown to be a compound measure of the …
Continue reading at www.utdallas.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • G10L21/028Voice signal separating using properties of sound source
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/06Decision making techniques; Pattern matching strategies
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers

Similar Documents

Publication Publication Date Title
CN109830245B (en) Multi-speaker voice separation method and system based on beam forming
Saruwatari et al. Blind source separation combining independent component analysis and beamforming
Vu et al. Blind speech separation employing directional statistics in an expectation maximization framework
Souden et al. A multichannel MMSE-based framework for speech source separation and noise reduction
McCowan et al. Robust speaker recognition using microphone arrays
Kolossa et al. Nonlinear postprocessing for blind speech separation
Ito et al. Designing the Wiener post-filter for diffuse noise suppression using imaginary parts of inter-channel cross-spectra
Kumatani et al. Beamforming with a maximum negentropy criterion
Knaak et al. Geometrically constrained independent component analysis
Koldovský et al. Semi-blind noise extraction using partially known position of the target source
Xiao et al. Beamforming networks using spatial covariance features for far-field speech recognition
Li et al. Online Directional Speech Enhancement Using Geometrically Constrained Independent Vector Analysis.
Hoang et al. Joint maximum likelihood estimation of power spectral densities and relative acoustic transfer functions for acoustic beamforming
Kovalyov et al. Dsenet: Directional signal extraction network for hearing improvement on edge devices
Yu et al. Automatic beamforming for blind extraction of speech from music environment using variance of spectral flux-inspired criterion
Maazaoui et al. Adaptive blind source separation with HRTFs beamforming preprocessing
Kim et al. Probabilistic spectral gain modification applied to beamformer-based noise reduction in a car environment
Martín-Doñas et al. Multi-channel block-online source extraction based on utterance adaptation
Yamaoka et al. Performance evaluation of nonlinear speech enhancement based on virtual increase of channels in reverberant environments
McCowan et al. Multi-channel sub-band speech recognition
Rafique et al. Mixed source prior for the fast independent vector analysis algorithm
Zamani et al. Convolutive blind source separation with independent vector analysis and beamforming
Goto et al. Study on geometrically constrained IVA with auxiliary function approach and VCD for in-car communication
Chen et al. Reference microphone selection and low-rank approximation based multichannel wiener filter with application to speech recognition
Mimura et al. Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition.