Xu, 2022 - Google Patents

English speech recognition and evaluation of pronunciation quality using deep learning

Xu, 2022

Document ID: 11884060325048484061
Author: Xu Y
Publication year: 2022
Publication venue: Mobile Information Systems

External Links

Cited by

Snippet

English is now one of the most important languages for economic exchange in various countries around the world, and it is also the most widely used language for cultural and information exchange. Like other countries, China likewise attaches highest significance to …

Continue reading at onlinelibrary.wiley.com (PDF) (other versions)

238000011156 evaluation 0 title abstract description 80

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass

Similar Documents

Publication	Publication Date	Title
Qian et al.	2017	Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system
Reddy et al.	2015	Toward completely automated vowel extraction: Introducing DARLA
Xu	2022	English speech recognition and evaluation of pronunciation quality using deep learning
Ruede et al.	2019	Yeah, right, uh-huh: a deep learning backchannel predictor
Franke et al.	2016	Phoneme boundary detection using deep bidirectional lstms
Korzekwa et al.	2022	Computer-assisted pronunciation training—Speech synthesis is almost all you need
US10283142B1 (en)	2019-05-07	Processor-implemented systems and methods for determining sound quality
Qian et al.	2018	A prompt-aware neural network approach to content-based scoring of non-native spontaneous speech
Wang	2021	Detecting pronunciation errors in spoken English tests based on multifeature fusion algorithm
CN117711444B (en)	2024-04-23	Interaction method, device, equipment and storage medium based on talent expression
Liu et al.	2022	AI recognition method of pronunciation errors in oral English speech with the help of big data for personalized learning
Han et al.	2022	[Retracted] The Modular Design of an English Pronunciation Level Evaluation System Based on Machine Learning
Radha et al.	2024	Speech and speaker recognition using raw waveform modeling for adult and children’s speech: A comprehensive review
Hou et al.	2019	Domain adversarial training for improving keyword spotting performance of esl speech
Hasan et al.	2022	Effect of vocal tract dynamics on neural network‐based speech recognition: A Bengali language‐based study
Zhou et al.	2018	Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis.
Gosztolya et al.	2019	Differentiating laughter types via HMM/DNN and probabilistic sampling
Yuan et al.	2023	The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN
Bao et al.	2022	[Retracted] An Auxiliary Teaching System for Spoken English Based on Speech Recognition Technology
Kothalkar et al.	2024	Child-adult speech diarization in naturalistic conditions of preschool classrooms using room-independent ResNet model and automatic speech recognition-based re-segmentation
Zheng	2022	[Retracted] An Analysis and Research on Chinese College Students’ Psychological Barriers in Oral English Output from a Cross‐Cultural Perspective
Bovbjerg et al.	2024	Self-Supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions
Wang	2022	English Speech Recognition and Pronunciation Quality Evaluation Model Based on Neural Network
Zhang	2022	[Retracted] English Speech Recognition System Model Based on Computer‐Aided Function and Neural Network Algorithm
Bartelds et al.	2021	Measuring foreign accent strength using an acoustic distance measure