Demuynck et al., 2000 - Google Patents

An efficient search space representation for large vocabulary continuous speech recognition

Demuynck et al., 2000

Document ID: 3508292832252745033
Author: Demuynck K; Duchateau J; Van Compernolle D; Wambacq P
Publication year: 2000
Publication venue: Speech communication

External Links

Cited by

Snippet

In pursuance of better performance, current speech recognition systems tend to use more and more complicated models for both the acoustic and the language component. Cross- word context dependent (CD) phone models and long-span statistical language models …

Continue reading at www.academia.edu (PDF) (other versions)

230000001419 dependent 0 abstract description 15

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems

Similar Documents

Publication	Publication Date	Title
Demuynck et al.	2000	An efficient search space representation for large vocabulary continuous speech recognition
US6668243B1 (en)	2003-12-23	Network and language models for use in a speech recognition system
Mohri et al.	2008	Speech recognition with weighted finite-state transducers
Mohri et al.	2002	Weighted finite-state transducers in speech recognition
Riccardi et al.	1996	Stochastic automata for language modeling
Aubert	2002	An overview of decoding techniques for large vocabulary continuous speech recognition
US6574597B1 (en)	2003-06-03	Fully expanded context-dependent networks for speech recognition
Hori et al.	2007	Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
US7711561B2 (en)	2010-05-04	Speech recognition system and technique
Mohri et al.	1999	Network optimizations for large-vocabulary speech recognition
GB2453366A (en)	2009-04-08	Automatic speech recognition method and apparatus
Renals et al.	1995	Efficient search using posterior phone probability estimates
Mohri et al.	1997	Weighted determinization and minimization for large vocabulary speech recognition.
Mohri et al.	1999	Integrated context-dependent networks in very large vocabulary speech recognition.
Riley et al.	1997	Transducer composition for context-dependent network expansion.
KR100726875B1 (en)	2007-06-14	Speech recognition device with complementary language model for typical mistakes in verbal conversation
US6980954B1 (en)	2005-12-27	Search method based on single triphone tree for large vocabulary continuous speech recognizer
Brugnara et al.	1997	Dynamic language models for interactive speech applications.
Ishikawa et al.	2006	Parallel LVCSR algorithm for cellphone-oriented multicore processors
US20030061046A1 (en)	2003-03-27	Method and system for integrating long-span language model into speech recognition system
Demuynck et al.	1997	A static lexicon network representation for cross-word context dependent phones.
Caseiro et al.	2001	Transducer composition for" on-the-fly" lexicon and language model integration
Novak	2004	Towards large vocabulary ASR on embedded platforms.
Zitouni et al.	2003	Statistical language modeling based on variable-length sequences
US20050288928A1 (en)	2005-12-29	Memory efficient decoding graph compilation system and method