Nothing Special   »   [go: up one dir, main page]

de Veth et al., 1999 - Google Patents

Acoustic pre-processing for optimal effectivity of missing feature theory

de Veth et al., 1999

View PDF
Document ID
1188383352583561087
Author
de Veth J
Cranen B
Wet F
Boves L
Publication year

External Links

Snippet

In this paper we investigate acoustic backing-off as an opera tionalization of Missing Feature Theory with the aim to increase recognition robustness. Acoustic backing-off effectively dimin ishes the detrimental influence of outlier values by using a new model of the …
Continue reading at repository.ubn.ru.nl (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
    • G10L25/09Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis

Similar Documents

Publication Publication Date Title
EP0792503B1 (en) Signal conditioned minimum error rate training for continuous speech recognition
Zhao An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition
EP0886263B1 (en) Environmentally compensated speech processing
Reynolds et al. Speaker verification using adapted Gaussian mixture models
Acero et al. Robust speech recognition by normalization of the acoustic space.
CA2233728C (en) Multiple models integration for multi-environment speech recognition
US8615393B2 (en) Noise suppressor for speech recognition
JPH0850499A (en) Signal identification method
WO1997010587A9 (en) Signal conditioned minimum error rate training for continuous speech recognition
Mokbel et al. Towards improving ASR robustness for PSN and GSM telephone applications
Hsu et al. Higher order cepstral moment normalization (HOCMN) for robust speech recognition
Liao et al. Joint uncertainty decoding for robust large vocabulary speech recognition
Haton Automatic speech recognition: A Review
de Veth et al. Acoustic backing-off as an implementation of missing feature theory
Ming et al. Union: a new approach for combining sub-band observations for noisy speech recognition
de Veth et al. Missing feature theory in ASR: make sure you miss the right type of features
de Veth et al. Acoustic pre-processing for optimal effectivity of missing feature theory
Parmar et al. Comparison of performance of the features of speech signal for non-intrusive speech quality assessment
Ming et al. Speech recognition with unknown partial feature corruption–a review of the union model
Upadhyay et al. Robust recognition of English speech in noisy environments using frequency warped signal processing
de Veth et al. Acoustic features and a distance measure that reduce the impact of training–test mismatch in ASR
Zhao Control system and speech recognition of exhibition hall digital media based on computer technology
Vali et al. Robust speech recognition by modifying clean and telephone feature vectors using bidirectional neural network.
Ebrahim Kafoori et al. Robust recognition of noisy speech through partial imputation of missing data
Surendran et al. Predictive adaptation and compensation for robust speech recognition