Nothing Special   »   [go: up one dir, main page]

Demuynck et al., 2000 - Google Patents

An efficient search space representation for large vocabulary continuous speech recognition

Demuynck et al., 2000

View PDF
Document ID
3508292832252745033
Author
Demuynck K
Duchateau J
Van Compernolle D
Wambacq P
Publication year
Publication venue
Speech communication

External Links

Snippet

In pursuance of better performance, current speech recognition systems tend to use more and more complicated models for both the acoustic and the language component. Cross- word context dependent (CD) phone models and long-span statistical language models …
Continue reading at www.academia.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/085Methods for reducing search complexity, pruning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems

Similar Documents

Publication Publication Date Title
Demuynck et al. An efficient search space representation for large vocabulary continuous speech recognition
US6668243B1 (en) Network and language models for use in a speech recognition system
Mohri et al. Speech recognition with weighted finite-state transducers
Mohri et al. Weighted finite-state transducers in speech recognition
Riccardi et al. Stochastic automata for language modeling
Aubert An overview of decoding techniques for large vocabulary continuous speech recognition
US6574597B1 (en) Fully expanded context-dependent networks for speech recognition
Hori et al. Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
US7711561B2 (en) Speech recognition system and technique
Mohri et al. Network optimizations for large-vocabulary speech recognition
GB2453366A (en) Automatic speech recognition method and apparatus
Renals et al. Efficient search using posterior phone probability estimates
Mohri et al. Weighted determinization and minimization for large vocabulary speech recognition.
Mohri et al. Integrated context-dependent networks in very large vocabulary speech recognition.
Riley et al. Transducer composition for context-dependent network expansion.
KR100726875B1 (en) Speech recognition device with complementary language model for typical mistakes in verbal conversation
US6980954B1 (en) Search method based on single triphone tree for large vocabulary continuous speech recognizer
Brugnara et al. Dynamic language models for interactive speech applications.
Ishikawa et al. Parallel LVCSR algorithm for cellphone-oriented multicore processors
US20030061046A1 (en) Method and system for integrating long-span language model into speech recognition system
Demuynck et al. A static lexicon network representation for cross-word context dependent phones.
Caseiro et al. Transducer composition for" on-the-fly" lexicon and language model integration
Novak Towards large vocabulary ASR on embedded platforms.
Zitouni et al. Statistical language modeling based on variable-length sequences
US20050288928A1 (en) Memory efficient decoding graph compilation system and method