Prapcoyo et al., 2019 - Google Patents

Implementation of Mel Frequency Cepstral Coefficient and Dynamic Time Warping For Bird Sound Classification

Prapcoyo et al., 2019

Document ID: 12475577532751122228
Author: Prapcoyo H; Putra B; Perwira R
Publication year: 2019
Publication venue: Conference SENATIK

External Links

Cited by

Snippet

Lovebird (Agapornis) is a type of bird that has become the belle of new pet birds lately. The interest of the hobbyist in this one song is because Lovebird has a unique chirp. For beginner lovebird fans, the lack of knowledge and experience about lovebird birds results in …

Continue reading at www.academia.edu (PDF) (other versions)

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/06—Decision making techniques; Pattern matching strategies
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/005—Speaker recognisers specially adapted for particular applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters

Similar Documents

Publication	Publication Date	Title
Permanasari et al.	2019	Speech recognition using dynamic time warping (DTW)
Singh et al.	2019	Statistical Analysis of Lower and Raised Pitch Voice Signal and Its Efficiency Calculation.
CN112259105B (en)	2022-09-20	Training method of voiceprint recognition model, storage medium and computer equipment
CN101923855A (en)	2010-12-22	Test-irrelevant voice print identifying system
CN112259104B (en)	2022-11-01	Training device for voiceprint recognition model
JP6908045B2 (en)	2021-07-21	Speech processing equipment, audio processing methods, and programs
CN108899033B (en)	2021-09-10	Method and device for determining speaker characteristics
CN103810996A (en)	2014-05-21	Processing method, device and system for voice to be tested
Drygajlo	2012	Automatic speaker recognition for forensic case assessment and interpretation
CN102723079A (en)	2012-10-10	Music and chord automatic identification method based on sparse representation
CN112632318A (en)	2021-04-09	Audio recommendation method, device and system and storage medium
da Silva et al.	2021	Evaluation of a sliding window mechanism as DataAugmentation over emotion detection on speech
CN108829739A (en)	2018-11-16	A kind of information-pushing method and device
Prapcoyo et al.	2019	Implementation of Mel Frequency Cepstral Coefficient and Dynamic Time Warping For Bird Sound Classification
KR20210071713A (en)	2021-06-16	Speech Skill Feedback System
CN114220419A (en)	2022-03-22	Voice evaluation method, device, medium and equipment
CN111859008A (en)	2020-10-30	Music recommending method and terminal
CN105895079A (en)	2016-08-24	Voice data processing method and device
KR102113879B1 (en)	2020-05-26	The method and apparatus for recognizing speaker's voice by using reference database
CN111091810A (en)	2020-05-01	VR game character expression control method based on voice information and storage medium
Zhang et al.	2017	Articulatory movement features for short-duration text-dependent speaker verification
Unnikrishnan et al.	2017	Mimicking voice recognition using MFCC-GMM framework
CN110931020B (en)	2022-05-24	Voice detection method and device
US7454337B1 (en)	2008-11-18	Method of modeling single data class from multi-class data
Gomes et al.	2021	Person identification based on voice recognition