McGraw et al., 2010 - Google Patents
Collecting Voices from the Cloud.McGraw et al., 2010
View PDF- Document ID
- 16659074675733661815
- Author
- McGraw I
- Lee C
- Hetherington I
- Seneff S
- Glass J
- Publication year
- Publication venue
- LREC
External Links
Snippet
The collection and transcription of speech data is typically an expensive and time- consuming task. Voice over IP and cloud computing are poised to eliminate this impediment to research on spoken language interfaces in many domains. This paper documents our …
- 230000035897 transcription 0 abstract description 18
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
- G06Q10/105—Human resources
- G06Q10/1053—Employment or hiring
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
- G06Q10/109—Time management, e.g. calendars, reminders, meetings, time accounting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
Similar Documents
Publication | Publication Date | Title |
---|---|---|
McGraw et al. | Collecting Voices from the Cloud. | |
Glass | Challenges for spoken dialogue systems | |
Asri et al. | Frames: a corpus for adding memory to goal-oriented dialogue systems | |
Walker et al. | DARPA communicator: cross-system results for the 2001 evaluation. | |
Walker et al. | DARPA communicator evaluation: progress from 2000 to 2001. | |
McGraw | Crowd-supervised training of spoken language systems | |
Zue et al. | Spoken dialogue systems | |
Bohus et al. | Conquest—an open-source dialog system for conferences | |
Nisimura et al. | Takemaru-kun: Speech-oriented information system for real world research platform | |
McGraw et al. | How to Control and Utilize Crowd‐Collected Speech | |
Ma et al. | Introducing bed word: A new automated speech recognition tool for sociolinguistic interview transcription | |
McGraw et al. | A self-labeling speech corpus: collecting spoken words with an online educational game. | |
Lai et al. | Conversational speech interfaces and technologies | |
Zue | Human computer interactions using language based technology | |
Gorisch et al. | Using automatic speech recognition in spoken corpus curation | |
Lamel | Spoken language dialog system development and evaluation at LIMSI | |
Arora et al. | Collaborative speech data acquisition for under resourced languages through crowdsourcing | |
Ynoguti et al. | A Brazilian Portuguese speech database | |
Wirén et al. | Experiences of an in-service Wizard-of-Oz data collection for the deployment of a call-routing application | |
Williams | Spoken dialogue systems: Challenges, and opportunities for research. | |
San Segundo et al. | Methodology for dialogue design in telephone-based spoken dialogue systems: a Spanish train information system. | |
Hurley et al. | Telephone data collection using the World Wide Web | |
Qasim et al. | Urdu speech corpus for travel domain | |
Tawo | Assessing the impact of (Nigerian, Ghanaian, Zimbabwean, Rwandan and Kenyan) accents on English-speaking speech recognition systems | |
De Wet et al. | The design, collection and annotation of speech databases in South Africa |