Dinkar et al., 2021 - Google Patents

From local hesitations to global impressions of a speaker's feeling of knowing

Dinkar et al., 2021

Document ID: 13048082839814338890
Author: Dinkar T; Biancardi B; Clavel C
Publication year: 2021
Publication venue: Proceedings of the 4th International Conference on Natural Language and Speech Processing (ICNLSP 2021)

External Links

Cited by

Snippet

The listener's interpretation of a speaker's utterance includes estimates about the speaker's commitment to what they are saying. Previous works have shown that fillers (eg “um”) are linked to both the speaker's metacognitive state, and the listener's impression of a speaker's …

Continue reading at aclanthology.org (PDF) (other versions)

239000000945 filler 0 abstract description 155

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information

Similar Documents

Publication	Publication Date	Title
US10334384B2 (en)	2019-06-25	Scheduling playback of audio in a virtual acoustic space
US8447604B1 (en)	2013-05-21	Method and apparatus for processing scripts and related data
KR101990023B1 (en)	2019-06-18	Method for chunk-unit separation rule and display automated key word to develop foreign language studying, and system thereof
US11076052B2 (en)	2021-07-27	Selective conference digest
EP3254279B1 (en)	2018-11-21	Conference word cloud
US20180027351A1 (en)	2018-01-25	Optimized virtual scene layout for spatial meeting playback
US9311914B2 (en)	2016-04-12	Method and apparatus for enhanced phonetic indexing and search
CN112233680B (en)	2024-02-13	Speaker character recognition method, speaker character recognition device, electronic equipment and storage medium
Moore	2015	Automated transcription and conversation analysis
US20210232776A1 (en)	2021-07-29	Method for recording and outputting conversion between multiple parties using speech recognition technology, and device therefor
Kopparapu	2015	Non-linguistic analysis of call center conversations
CN115485768A (en)	2022-12-16	End-to-end multi-speaker overlapping speech recognition
CN114328867A (en)	2022-04-12	Intelligent interruption method and device in man-machine conversation
Soe et al.	2021	Evaluating AI assisted subtitling
CN111798871A (en)	2020-10-20	Session link identification method, device and equipment and storage medium
US20100076747A1 (en)	2010-03-25	Mass electronic question filtering and enhancement system for audio broadcasts and voice conferences
Bechet et al.	2014	Adapting dependency parsing to spontaneous speech for open domain spoken language understanding
CN107886940B (en)	2021-10-08	Voice translation processing method and device
Dinkar et al.	2021	From local hesitations to global impressions of a speaker’s feeling of knowing
Yamasaki et al.	2023	Transcribing And Aligning Conversational Speech: A Hybrid Pipeline Applied To French Conversations
Dinkar et al.	2021	From local hesitations to global impressions of the listener
US10657202B2 (en)	2020-05-19	Cognitive presentation system and method
US11632345B1 (en)	2023-04-18	Message management for communal account
Kwak et al.	2024	VoxMM: Rich Transcription of Conversations in the Wild
Mirzaei et al.	2021	Adaptive Listening Difficulty Detection for L2 Learners Through Moderating ASR Resources.