de Veth et al., 1999 - Google Patents
Acoustic pre-processing for optimal effectivity of missing feature theoryde Veth et al., 1999
View PDF- Document ID
- 1188383352583561087
- Author
- de Veth J
- Cranen B
- Wet F
- Boves L
- Publication year
External Links
Snippet
In this paper we investigate acoustic backing-off as an opera tionalization of Missing Feature Theory with the aim to increase recognition robustness. Acoustic backing-off effectively dimin ishes the detrimental influence of outlier values by using a new model of the …
- 238000007781 pre-processing 0 title description 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0792503B1 (en) | Signal conditioned minimum error rate training for continuous speech recognition | |
Zhao | An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition | |
EP0886263B1 (en) | Environmentally compensated speech processing | |
Reynolds et al. | Speaker verification using adapted Gaussian mixture models | |
Acero et al. | Robust speech recognition by normalization of the acoustic space. | |
CA2233728C (en) | Multiple models integration for multi-environment speech recognition | |
US8615393B2 (en) | Noise suppressor for speech recognition | |
JPH0850499A (en) | Signal identification method | |
WO1997010587A9 (en) | Signal conditioned minimum error rate training for continuous speech recognition | |
Mokbel et al. | Towards improving ASR robustness for PSN and GSM telephone applications | |
Hsu et al. | Higher order cepstral moment normalization (HOCMN) for robust speech recognition | |
Liao et al. | Joint uncertainty decoding for robust large vocabulary speech recognition | |
Haton | Automatic speech recognition: A Review | |
de Veth et al. | Acoustic backing-off as an implementation of missing feature theory | |
Ming et al. | Union: a new approach for combining sub-band observations for noisy speech recognition | |
de Veth et al. | Missing feature theory in ASR: make sure you miss the right type of features | |
de Veth et al. | Acoustic pre-processing for optimal effectivity of missing feature theory | |
Parmar et al. | Comparison of performance of the features of speech signal for non-intrusive speech quality assessment | |
Ming et al. | Speech recognition with unknown partial feature corruption–a review of the union model | |
Upadhyay et al. | Robust recognition of English speech in noisy environments using frequency warped signal processing | |
de Veth et al. | Acoustic features and a distance measure that reduce the impact of training–test mismatch in ASR | |
Zhao | Control system and speech recognition of exhibition hall digital media based on computer technology | |
Vali et al. | Robust speech recognition by modifying clean and telephone feature vectors using bidirectional neural network. | |
Ebrahim Kafoori et al. | Robust recognition of noisy speech through partial imputation of missing data | |
Surendran et al. | Predictive adaptation and compensation for robust speech recognition |