McGraw et al., 2010 - Google Patents

Collecting Voices from the Cloud.

McGraw et al., 2010

Document ID: 16659074675733661815
Author: McGraw I; Lee C; Hetherington I; Seneff S; Glass J
Publication year: 2010
Publication venue: LREC

External Links

Cited by

Snippet

The collection and transcription of speech data is typically an expensive and time- consuming task. Voice over IP and cloud computing are poised to eliminate this impediment to research on spoken language interfaces in many domains. This paper documents our …

Continue reading at citeseerx.ist.psu.edu (PDF) (other versions)

230000035897 transcription 0 abstract description 18

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
- G06Q10/105—Human resources
- G06Q10/1053—Employment or hiring
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
- G06Q10/109—Time management, e.g. calendars, reminders, meetings, time accounting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions

Similar Documents

Publication	Publication Date	Title
McGraw et al.	2010	Collecting Voices from the Cloud.
Glass	1999	Challenges for spoken dialogue systems
Asri et al.	2017	Frames: a corpus for adding memory to goal-oriented dialogue systems
Walker et al.	2002	DARPA communicator: cross-system results for the 2001 evaluation.
Walker et al.	2002	DARPA communicator evaluation: progress from 2000 to 2001.
McGraw	2012	Crowd-supervised training of spoken language systems
Zue et al.	2008	Spoken dialogue systems
Bohus et al.	2007	Conquest—an open-source dialog system for conferences
Nisimura et al.	2003	Takemaru-kun: Speech-oriented information system for real world research platform
McGraw et al.	2013	How to Control and Utilize Crowd‐Collected Speech
Ma et al.	2024	Introducing bed word: A new automated speech recognition tool for sociolinguistic interview transcription
McGraw et al.	2009	A self-labeling speech corpus: collecting spoken words with an online educational game.
Lai et al.	2009	Conversational speech interfaces and technologies
Zue	1994	Human computer interactions using language based technology
Gorisch et al.	2020	Using automatic speech recognition in spoken corpus curation
Lamel	1998	Spoken language dialog system development and evaluation at LIMSI
Arora et al.	2016	Collaborative speech data acquisition for under resourced languages through crowdsourcing
Ynoguti et al.	2008	A Brazilian Portuguese speech database
Wirén et al.	2007	Experiences of an in-service Wizard-of-Oz data collection for the deployment of a call-routing application
Williams	2009	Spoken dialogue systems: Challenges, and opportunities for research.
San Segundo et al.	2001	Methodology for dialogue design in telephone-based spoken dialogue systems: a Spanish train information system.
Hurley et al.	1996	Telephone data collection using the World Wide Web
Qasim et al.	2016	Urdu speech corpus for travel domain
Tawo	2019	Assessing the impact of (Nigerian, Ghanaian, Zimbabwean, Rwandan and Kenyan) accents on English-speaking speech recognition systems
De Wet et al.	2006	The design, collection and annotation of speech databases in South Africa