Shahamiri, 2022 - Google Patents

Neural network-based multi-view enhanced multi-learner active learning: theory and experiments

Shahamiri, 2022

Document ID: 12473260600622029887
Author: Shahamiri S
Publication year: 2022
Publication venue: Journal of Experimental & Theoretical Artificial Intelligence

External Links

Cited by

Snippet

As applications of neural networks increase in our daily lives, their practicality and accuracy become more of a challenge as they are applied to approximate more complicated functions typically composed of different dependent or independent views. While the complexity of the …

Continue reading at www.researchgate.net (PDF) (other versions)

230000001537 neural 0 title abstract description 66

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions

Similar Documents

Publication	Publication Date	Title
US10559225B1 (en)	2020-02-11	Computer-implemented systems and methods for automatically generating an assessment of oral recitations of assessment items
Gharavian et al.	2012	Speech emotion recognition using FCBF feature selection method and GA-optimized fuzzy ARTMAP neural network
Sharma et al.	2013	Acoustic model adaptation using in-domain background models for dysarthric speech recognition
CN111081230B (en)	2024-09-17	Speech recognition method and device
US20060206333A1 (en)	2006-09-14	Speaker-dependent dialog adaptation
Jacob	2017	Modelling speech emotion recognition using logistic regression and decision trees
Mallouh et al.	2018	New transformed features generated by deep bottleneck extractor and a GMM–UBM classifier for speaker age and gender classification
Shahamiri	2022	Neural network-based multi-view enhanced multi-learner active learning: theory and experiments
Khanam et al.	2022	Text to speech synthesis: A systematic review, deep learning based architecture and future research direction
Yücesoy et al.	2016	A new approach with score-level fusion for the classification of a speaker age and gender
Cardona et al.	2017	Online phoneme recognition using multi-layer perceptron networks combined with recurrent non-linear autoregressive neural networks with exogenous inputs
Maiti et al.	2020	Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement
Ismail et al.	2021	Development of a regional voice dataset and speaker classification based on machine learning
CN115392263A (en)	2022-11-25	Knowledge selection-based dialogue model and training method thereof
Li et al.	2014	Cost‐Sensitive Learning for Emotion Robust Speaker Recognition
Gao et al.	2022	Seamless equal accuracy ratio for inclusive CTC speech recognition
Ons et al.	2014	Fast vocabulary acquisition in an NMF-based self-learning vocal user interface
Wang	2021	Detecting pronunciation errors in spoken English tests based on multifeature fusion algorithm
Silva et al.	2014	Intelligent genetic fuzzy inference system for speech recognition: An approach from low order feature based on discrete cosine transform
Orosoo et al.	2025	Transforming English language learning: Advanced speech recognition with MLP-LSTM for personalized education
CN118053420A (en)	2024-05-17	Speech recognition method, apparatus, device, medium and program product
CN115132195B (en)	2024-03-12	Voice wakeup method, device, equipment, storage medium and program product
Oliveira et al.	2022	A two-level item response theory model to evaluate speech synthesis and recognition
Garcia-Romero et al.	2012	The UMD-JHU 2011 speaker recognition system
Williams	2008	Evaluating user simulations with the Cramér–von Mises divergence