Demuynck et al., 2000 - Google Patents
An efficient search space representation for large vocabulary continuous speech recognitionDemuynck et al., 2000
View PDF- Document ID
- 3508292832252745033
- Author
- Demuynck K
- Duchateau J
- Van Compernolle D
- Wambacq P
- Publication year
- Publication venue
- Speech communication
External Links
Snippet
In pursuance of better performance, current speech recognition systems tend to use more and more complicated models for both the acoustic and the language component. Cross- word context dependent (CD) phone models and long-span statistical language models …
- 230000001419 dependent 0 abstract description 15
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Demuynck et al. | An efficient search space representation for large vocabulary continuous speech recognition | |
US6668243B1 (en) | Network and language models for use in a speech recognition system | |
Mohri et al. | Speech recognition with weighted finite-state transducers | |
Mohri et al. | Weighted finite-state transducers in speech recognition | |
Riccardi et al. | Stochastic automata for language modeling | |
Aubert | An overview of decoding techniques for large vocabulary continuous speech recognition | |
US6574597B1 (en) | Fully expanded context-dependent networks for speech recognition | |
Hori et al. | Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition | |
US7711561B2 (en) | Speech recognition system and technique | |
Mohri et al. | Network optimizations for large-vocabulary speech recognition | |
GB2453366A (en) | Automatic speech recognition method and apparatus | |
Renals et al. | Efficient search using posterior phone probability estimates | |
Mohri et al. | Weighted determinization and minimization for large vocabulary speech recognition. | |
Mohri et al. | Integrated context-dependent networks in very large vocabulary speech recognition. | |
Riley et al. | Transducer composition for context-dependent network expansion. | |
KR100726875B1 (en) | Speech recognition device with complementary language model for typical mistakes in verbal conversation | |
US6980954B1 (en) | Search method based on single triphone tree for large vocabulary continuous speech recognizer | |
Brugnara et al. | Dynamic language models for interactive speech applications. | |
Ishikawa et al. | Parallel LVCSR algorithm for cellphone-oriented multicore processors | |
US20030061046A1 (en) | Method and system for integrating long-span language model into speech recognition system | |
Demuynck et al. | A static lexicon network representation for cross-word context dependent phones. | |
Caseiro et al. | Transducer composition for" on-the-fly" lexicon and language model integration | |
Novak | Towards large vocabulary ASR on embedded platforms. | |
Zitouni et al. | Statistical language modeling based on variable-length sequences | |
US20050288928A1 (en) | Memory efficient decoding graph compilation system and method |