Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2905055.2905139acmotherconferencesArticle/Chapter ViewAbstractPublication PagesictcsConference Proceedingsconference-collections
research-article

Speaker Identification and Authentication System using Energy based Cepstral Data Technique

Published: 04 March 2016 Publication History

Abstract

This paper aims to design a speaker identification system using data optimization technique. Speaker identification is an important task for authentication and verification in a system. In this paper a technique of text-dependent speaker identification using energy optimized Mel Frequency Cepstral Coefficients (MFCC) features from the speech signal are derived. At the stage of classification, a distance measure technique is used. Energy of each frame in the training and test files are calculated. Energy frames with respect to the pitch of each speaker are retained and the rest are pruned from the original speech training files. In this way, frames that are only helpful in identifying the speaker is kept for distance measure. Results are evaluated using MFCC-13 and MFCC-39 coefficients. Euclidean distance metric is used to find the speaker from the existing training files.

References

[1]
L.R. Rabiner and B.H. Juang, Fundamentals of Speech Recognition, Prentice-Hall, N.J.: Englewood Cliffs, 1993.
[2]
Rabiner, L.R., A tutorial on Hidden Markov Models, Proceeding of IEEE, Vol. 73, pp. 1349--1387, 1989.
[3]
Rabiner L.R., "A Tutorial on Hidden Markov Models and selected Applications in Speech Recognition", Proc, IEEE, Vol 77, 1989, pp 257--286
[4]
Fu Zhonghua; Zhao Rongchun; "An overview of modeling technology of speaker recognition", IEEE Proceedings of the International Conference on Neural Networks and Signal Processing Volume 2, Page(s):887--891, Dec. 2003
[5]
Samudravijaya K, "Speech and Speaker Recognition: A Tutorial", Tata Institute of Fundamental Research
[6]
http://www.fon.hum.uva.nl/praat/ used for speech recording
[7]
audacity.sourceforge.net
[8]
S. Furui {1986}, "Speaker-independent isolated word recognition using dynamic features of speech spectrum", IEEE Transactions on Acoustic, Speech, Signal Processing, Vol. 34, No. 1, pp. 52--59
[9]
Zheng F., Zhang, G., Song, Z., "Comparison of different imple-mentations of MFCC", J. Computer Science & Technology, 16(6):582--589, Sept. 2001
[10]
U. Shrawankar and V. M. Thakare, "Techniques for feature extraction in speech recognition system: A comparative study," arXiv preprint arXiv:1305.1145, 2013.
[11]
J. Li, et al., "An overview of noise-robust automatic speech recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22, pp. 745--777, 2014.
[12]
C. Kim and R. M. Stern, "Power-normalized cepstral coefficients (PNCC) for robust speech recognition," in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012, pp. 4101--4104.
[13]
B. Gao and W. L. Woo, "Wearable Audio Monitoring: Content-Based Processing Methodology and Implementation," IEEE Transactions on Human-Machine Systems, vol. 44, pp. 222--233, 2014.
[14]
The Mathworks -- MATLAB and SIMULINK for Technical Computing. 10 June 2005. http://www.mathworks.com

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICTCS '16: Proceedings of the Second International Conference on Information and Communication Technology for Competitive Strategies
March 2016
843 pages
ISBN:9781450339629
DOI:10.1145/2905055
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 March 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Euclidean Distance
  2. Log Energy
  3. MFCC
  4. Speaker identification

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICTCS '16

Acceptance Rates

Overall Acceptance Rate 97 of 270 submissions, 36%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 48
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media