Nothing Special   »   [go: up one dir, main page]

DE69916297D1 - Zwischen-wörter verbindung phonemische modelle - Google Patents

Zwischen-wörter verbindung phonemische modelle

Info

Publication number
DE69916297D1
DE69916297D1 DE69916297T DE69916297T DE69916297D1 DE 69916297 D1 DE69916297 D1 DE 69916297D1 DE 69916297 T DE69916297 T DE 69916297T DE 69916297 T DE69916297 T DE 69916297T DE 69916297 D1 DE69916297 D1 DE 69916297D1
Authority
DE
Germany
Prior art keywords
word
phone
models
vocabulary
input utterance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69916297T
Other languages
English (en)
Inventor
Vladimir Sejnoha
Tom Lynch
Ramesh Sarukkai
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lernout and Hauspie Speech Products NV
Original Assignee
Lernout and Hauspie Speech Products NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lernout and Hauspie Speech Products NV filed Critical Lernout and Hauspie Speech Products NV
Application granted granted Critical
Publication of DE69916297D1 publication Critical patent/DE69916297D1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • G10L15/05Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/022Demisyllables, biphones or triphones being the recognition units

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
  • Navigation (AREA)
  • Telephone Set Structure (AREA)
  • Telephone Function (AREA)
DE69916297T 1998-09-29 1999-09-29 Zwischen-wörter verbindung phonemische modelle Expired - Lifetime DE69916297D1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10237398P 1998-09-29 1998-09-29
PCT/US1999/022501 WO2000019409A1 (en) 1998-09-29 1999-09-29 Inter-word triphone models

Publications (1)

Publication Number Publication Date
DE69916297D1 true DE69916297D1 (de) 2004-05-13

Family

ID=22289500

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69916297T Expired - Lifetime DE69916297D1 (de) 1998-09-29 1999-09-29 Zwischen-wörter verbindung phonemische modelle

Country Status (7)

Country Link
US (1) US6606594B1 (de)
EP (1) EP1116218B1 (de)
AT (1) ATE263997T1 (de)
AU (1) AU6501999A (de)
CA (1) CA2395012A1 (de)
DE (1) DE69916297D1 (de)
WO (1) WO2000019409A1 (de)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19939102C1 (de) * 1999-08-18 2000-10-26 Siemens Ag Verfahren und Anordnung zum Erkennen von Sprache
DE10120513C1 (de) * 2001-04-26 2003-01-09 Siemens Ag Verfahren zur Bestimmung einer Folge von Lautbausteinen zum Synthetisieren eines Sprachsignals einer tonalen Sprache
JP2003208195A (ja) * 2002-01-16 2003-07-25 Sharp Corp 連続音声認識装置および連続音声認識方法、連続音声認識プログラム、並びに、プログラム記録媒体
TWI454955B (zh) * 2006-12-29 2014-10-01 Nuance Communications Inc 使用模型檔產生動畫的方法及電腦可讀取的訊號承載媒體
KR100897554B1 (ko) * 2007-02-21 2009-05-15 삼성전자주식회사 분산 음성인식시스템 및 방법과 분산 음성인식을 위한 단말기
US8536976B2 (en) * 2008-06-11 2013-09-17 Veritrix, Inc. Single-channel multi-factor authentication
US8185646B2 (en) * 2008-11-03 2012-05-22 Veritrix, Inc. User authentication for social networks
US8166297B2 (en) 2008-07-02 2012-04-24 Veritrix, Inc. Systems and methods for controlling access to encrypted data stored on a mobile device
US8914279B1 (en) * 2011-09-23 2014-12-16 Google Inc. Efficient parsing with structured prediction cascades
US9602666B2 (en) 2015-04-09 2017-03-21 Avaya Inc. Silence density models
US10134425B1 (en) * 2015-06-29 2018-11-20 Amazon Technologies, Inc. Direction-based speech endpointing
US10121471B2 (en) * 2015-06-29 2018-11-06 Amazon Technologies, Inc. Language model speech endpointing
US11615239B2 (en) * 2020-03-31 2023-03-28 Adobe Inc. Accuracy of natural language input classification utilizing response delay

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS57178295A (en) * 1981-04-27 1982-11-02 Nippon Electric Co Continuous word recognition apparatus
US5268990A (en) 1991-01-31 1993-12-07 Sri International Method for recognizing speech using linguistically-motivated hidden Markov models
US5502790A (en) * 1991-12-24 1996-03-26 Oki Electric Industry Co., Ltd. Speech recognition method and system using triphones, diphones, and phonemes
JPH0728487A (ja) 1993-03-26 1995-01-31 Texas Instr Inc <Ti> 音声認識方法
US5819221A (en) * 1994-08-31 1998-10-06 Texas Instruments Incorporated Speech recognition using clustered between word and/or phrase coarticulation
US5937384A (en) * 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US6163769A (en) * 1997-10-02 2000-12-19 Microsoft Corporation Text-to-speech using clustered context-dependent phoneme-based units

Also Published As

Publication number Publication date
WO2000019409A1 (en) 2000-04-06
EP1116218B1 (de) 2004-04-07
US6606594B1 (en) 2003-08-12
EP1116218A1 (de) 2001-07-18
WO2000019409A9 (en) 2000-08-31
CA2395012A1 (en) 2000-04-06
ATE263997T1 (de) 2004-04-15
AU6501999A (en) 2000-04-17

Similar Documents

Publication Publication Date Title
CA2315832A1 (en) System for using silence in speech recognition
CA2275774A1 (en) Selection of superwords based on criteria relevant to both speech recognition and understanding
MX9703138A (es) Reconocimiento de lenguaje.
WO2007117814A3 (en) Voice signal perturbation for speech recognition
DE69916297D1 (de) Zwischen-wörter verbindung phonemische modelle
WO2006062707A3 (en) System and method for speech recognition-enabled automated call routing
WO2002054033A3 (en) Hierarchical language models for speech recognition
EP1205908A3 (de) Aussprache von neuen Wörtern zur Sprachverarbeitung
ATE395685T1 (de) Spracherkennung durch wort-in-phrase-befehl
DE69827667D1 (de) Vokoder basierter spracherkenner
EP1629464A4 (de) Spracherkennungssystem und verfahren auf phonetischer basis
AU2001250579A1 (en) Discriminatively trained mixture models in continuous speech recognition
MX9505299A (es) Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion.
ATE235733T1 (de) Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner
KR19980070329A (ko) 사용자 정의 문구의 화자 독립 인식을 위한 방법 및 시스템
Nafis et al. Speech to text conversion in real-time
Boite et al. A new approach towards keyword spotting.
CN110782895A (zh) 一种基于人工智能的人机语音系统
WO2001026092A3 (en) Attribute-based word modeling
EP0916972A3 (de) Spracherkennungsverfahren und Spracherkennungsvorrichtung
WO2000046787A3 (en) System and method for automating transcription services
Neto et al. The development of a multi-purpose spoken dialogue system.
Kaur et al. Issues involved in speech to text conversion
Roe Deployment of human-machine dialogue systems.
Takahashi et al. Interactive voice technology development for telecommunications applications

Legal Events

Date Code Title Description
8332 No legal effect for de