Nothing Special   »   [go: up one dir, main page]

DE602004023364D1 - Vorrichtung und Verfahren zur Spracherkennung - Google Patents

Vorrichtung und Verfahren zur Spracherkennung

Info

Publication number
DE602004023364D1
DE602004023364D1 DE602004023364T DE602004023364T DE602004023364D1 DE 602004023364 D1 DE602004023364 D1 DE 602004023364D1 DE 602004023364 T DE602004023364 T DE 602004023364T DE 602004023364 T DE602004023364 T DE 602004023364T DE 602004023364 D1 DE602004023364 D1 DE 602004023364D1
Authority
DE
Germany
Prior art keywords
speech recognition
speech
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602004023364T
Other languages
English (en)
Inventor
Toshiaki Fukada
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Publication of DE602004023364D1 publication Critical patent/DE602004023364D1/de
Anticipated expiration legal-status Critical
Active legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
DE602004023364T 2003-12-12 2004-12-10 Vorrichtung und Verfahren zur Spracherkennung Active DE602004023364D1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2003415425A JP4040573B2 (ja) 2003-12-12 2003-12-12 音声認識装置および方法

Publications (1)

Publication Number Publication Date
DE602004023364D1 true DE602004023364D1 (de) 2009-11-12

Family

ID=34510574

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602004023364T Active DE602004023364D1 (de) 2003-12-12 2004-12-10 Vorrichtung und Verfahren zur Spracherkennung

Country Status (4)

Country Link
US (1) US7624011B2 (de)
EP (1) EP1542207B1 (de)
JP (1) JP4040573B2 (de)
DE (1) DE602004023364D1 (de)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040073690A1 (en) 2002-09-30 2004-04-15 Neil Hepworth Voice over IP endpoint call admission
US7359979B2 (en) 2002-09-30 2008-04-15 Avaya Technology Corp. Packet prioritization and associated bandwidth and buffer management techniques for audio over IP
US8103510B2 (en) * 2003-12-26 2012-01-24 Kabushikikaisha Kenwood Device control device, speech recognition device, agent device, on-vehicle device control device, navigation device, audio device, device control method, speech recognition method, agent processing method, on-vehicle device control method, navigation method, and audio device control method, and program
US7978827B1 (en) 2004-06-30 2011-07-12 Avaya Inc. Automatic configuration of call handling based on end-user needs and characteristics
US20060009974A1 (en) * 2004-07-09 2006-01-12 Matsushita Electric Industrial Co., Ltd. Hands-free voice dialing for portable and remote devices
US20120253823A1 (en) * 2004-09-10 2012-10-04 Thomas Barton Schalk Hybrid Dialog Speech Recognition for In-Vehicle Automated Interaction and In-Vehicle Interfaces Requiring Minimal Driver Processing
US8244545B2 (en) * 2006-03-30 2012-08-14 Microsoft Corporation Dialog repair based on discrepancies between user model predictions and speech recognition results
JP4188989B2 (ja) * 2006-09-15 2008-12-03 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識プログラム
US20080091426A1 (en) * 2006-10-12 2008-04-17 Rod Rempel Adaptive context for automatic speech recognition systems
CN101377924A (zh) * 2007-08-31 2009-03-04 鹏智科技(深圳)有限公司 可会话的类生物装置及其会话方法
US8401780B2 (en) * 2008-01-17 2013-03-19 Navteq B.V. Method of prioritizing similar names of locations for use by a navigation system
DE102008028090A1 (de) * 2008-02-29 2009-09-10 Navigon Ag Verfahren zum Betrieb eines Navigationssystems
US8218751B2 (en) 2008-09-29 2012-07-10 Avaya Inc. Method and apparatus for identifying and eliminating the source of background noise in multi-party teleconferences
WO2010061751A1 (ja) * 2008-11-25 2010-06-03 旭化成株式会社 重み係数生成装置、音声認識装置、ナビゲーション装置、車両、重み係数生成方法、及び重み係数生成プログラム
DE102010040553A1 (de) * 2010-09-10 2012-03-15 Siemens Aktiengesellschaft Spracherkennungsverfahren
JP5370335B2 (ja) * 2010-10-26 2013-12-18 日本電気株式会社 音声認識支援システム、音声認識支援装置、利用者端末、方法およびプログラム
JP5799733B2 (ja) * 2011-10-12 2015-10-28 富士通株式会社 認識装置、認識プログラムおよび認識方法
US9190057B2 (en) 2012-12-12 2015-11-17 Amazon Technologies, Inc. Speech model retrieval in distributed speech recognition systems
US9097548B2 (en) * 2013-01-07 2015-08-04 Televav, Inc. Content delivery system with natural language mechanism and method of operation thereof
JP6413263B2 (ja) * 2014-03-06 2018-10-31 株式会社デンソー 報知装置
US10089765B2 (en) * 2014-10-20 2018-10-02 Bernardo Jose Martinez-Avalos Methods and computer programs to create images and information based in texts
JP6443843B2 (ja) * 2015-09-17 2018-12-26 日本電信電話株式会社 言語モデル作成装置、言語モデル作成方法、およびプログラム
EP3392740A4 (de) * 2015-12-18 2018-12-19 Sony Corporation Informationsverarbeitungsvorrichtung, informationsverarbeitungsverfahren und programm
CN105975099B (zh) * 2016-04-28 2020-02-04 百度在线网络技术(北京)有限公司 输入法的实现方法和装置
US20180196798A1 (en) * 2017-01-06 2018-07-12 Wipro Limited Systems and methods for creating concept maps using concept gravity matrix
JP6987447B2 (ja) * 2017-11-03 2022-01-05 アルパイン株式会社 音声認識装置
KR20190113693A (ko) * 2019-09-18 2019-10-08 엘지전자 주식회사 단어 사용 빈도를 고려하여 사용자의 음성을 인식하는 인공 지능 장치 및 그 방법
CN114491279A (zh) * 2022-02-22 2022-05-13 车主邦(北京)科技有限公司 油站的选址方法、装置及电子设备

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2091658A1 (en) * 1993-03-15 1994-09-16 Matthew Lennig Method and apparatus for automation of directory assistance using speech recognition
US5524169A (en) * 1993-12-30 1996-06-04 International Business Machines Incorporated Method and system for location-specific speech recognition
JP2907728B2 (ja) 1994-08-10 1999-06-21 富士通テン株式会社 音声処理装置
JPH10143191A (ja) * 1996-11-13 1998-05-29 Hitachi Ltd 音声認識システム
US5995894A (en) * 1997-05-27 1999-11-30 Case Corporation System for analyzing spatially-variable harvest data by pass
US6122361A (en) * 1997-09-12 2000-09-19 Nortel Networks Corporation Automated directory assistance system utilizing priori advisor for predicting the most likely requested locality
US6483896B1 (en) * 1998-02-05 2002-11-19 At&T Corp. Speech recognition using telephone call parameters
JP3500948B2 (ja) 1998-02-18 2004-02-23 株式会社デンソー 音声認識装置
JP3990075B2 (ja) * 1999-06-30 2007-10-10 株式会社東芝 音声認識支援方法及び音声認識システム
JP2001328451A (ja) * 2000-05-18 2001-11-27 Denso Corp 進行路推定装置、先行車認識装置、及び記録媒体
US6907436B2 (en) * 2000-10-27 2005-06-14 Arizona Board Of Regents, Acting For And On Behalf Of Arizona State University Method for classifying data using clustering and classification algorithm supervised
US20020072917A1 (en) * 2000-12-11 2002-06-13 Irvin David Rand Method and apparatus for speech recognition incorporating location information
US20020111810A1 (en) * 2001-02-15 2002-08-15 Khan M. Salahuddin Spatially built word list for automatic speech recognition program and method for formation thereof
US7184957B2 (en) * 2002-09-25 2007-02-27 Toyota Infotechnology Center Co., Ltd. Multiple pass speech recognition method and system
US20040193603A1 (en) * 2003-03-28 2004-09-30 Ljubicich Philip A. Technique for effectively searching for information in response to requests in information assistance service

Also Published As

Publication number Publication date
JP4040573B2 (ja) 2008-01-30
US7624011B2 (en) 2009-11-24
EP1542207B1 (de) 2009-09-30
EP1542207A1 (de) 2005-06-15
US20050131699A1 (en) 2005-06-16
JP2005173390A (ja) 2005-06-30

Similar Documents

Publication Publication Date Title
DE60309822D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE602004023364D1 (de) Vorrichtung und Verfahren zur Spracherkennung
DE60317025D1 (de) Vorrichtung und Verfahren zur Gesichtserkennung
DE60234530D1 (de) Vorrichtung und verfahren zur spracherkennung
DE60207863D1 (de) Vorrichtung und Verfahren zur Gesichtserkennung
DE602004014675D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE60213490D1 (de) Gerät und Verfahren zur Fingerabdruckerkennung
DE60237007D1 (de) Verfahren und vorrichtung zur kurzfristigen inspekrobustheit
DE60310785D1 (de) Verfahren und Vorrichtung zur Übersetzung von gesprochener Sprache
DE60218252D1 (de) Verfahren und Vorrichtung zur Sprachtranskodierung
DE602004019713D1 (de) Vorrichtung und verfahren zur synchronisierten antitachykarden stimulation
DE60217597D1 (de) Gerät und Verfahren zur Personenerkennung
DE50203544D1 (de) Verfahren und Vorrichtung zur Drehbearbeitung
DE60124559D1 (de) Einrichtung und verfahren zur spracherkennung
DE60316912D1 (de) Verfahren zur Spracherkennung
DE602006000487D1 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE10359431A8 (de) Verfahren und Vorrichtung zur vaskulären Navigation
DE602004018278D1 (de) Vorrichtung und verfahren zur schnellen detektion
DE60229315D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE602004026813D1 (de) Verfahren und Vorrichtung zur Bürstenherstellung
DE50109323D1 (de) Verfahren und vorrichtung zur spracherkennung
DE502004002300D1 (de) Verfahren zur sprecherabhängigen spracherkennung und spracherkennungssystem
DE60216907D1 (de) Vorrichtung und Verfahren zur Wellenlängenbestimmung
DE602005008201D1 (de) Vorrichtung und verfahren zur druckreduzierung
DE60205421D1 (de) Verfahren und Vorrichtung zur Sprachsynthese

Legal Events

Date Code Title Description
8364 No opposition during term of opposition