Efficient speaker identification from speech transmitted over Bluetooth networks

Ali A. Khalil¹,
Mustafa M. Abd Elnaby¹,
Elsayed M. Saad²,
Azzam Y. Al-nahari³,
Nayel Al-Zubi⁴,
Mohsen A. M. El-Bendary⁵ &
…
Fathi E. Abd El-Samie⁶

368 Accesses
2 Citations
Explore all metrics

Abstract

This paper studies the process of speaker identification over Bluetooth networks. Bluetooth channel degradations are considered prior to the speaker identification process. The work in this paper employs Mel-frequency cepstral coefficients for feature extraction. Features are extracted from different transforms of the received speech signals such as the discrete cosine transform (DCT), signal plus DCT, discrete sine transform (DST), signal plus DST, discrete wavelet transform (DWT), and signal plus DWT. A neural network classifier is used in the experiments, while the training phase uses clean speech signals and the testing phase uses degraded signals due to communication over the Bluetooth channel. A comparison is carried out between the different methods of feature extraction showing that the DCT achieves the highest recognition rates.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Speaker Identification in a Multi-speaker Environment

Speaker Identification Using MFCC Feature Extraction ANN Classification Technique

Article 01 May 2024

Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients

Article 17 September 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Al Bawab, Z., et al. (2003). Speech recognition over bluetooth wireless channels. EUROSPEECH 2003, Geneva.
Chadha, A. (2011). Text-independent speaker recognition for low SNR environments with encryption. International Journal of Computer Applications, 31(10) (0975–8887), October 2011.
Chavan, M. S., & Chougule, S. V. (2012). Speaker features and recognition techniques: A review. International Journal of Computational Engineering Research. ISSN: 2250–3005, IJCER, May–June 2012, Vol. 2, Issue No. 3, 720–728.
El-Bendary, M. A. M. M. El-azm, A. E. A., El-Fishawy, N. A., Shawki, F., Abd-ElSamie, F. E., El-Tokhy, M. A. R., et al. (2012). Performance of the audio signals transmission over wireless networks with the channel interleaving considerations. EURASIP Journal on Audio, Speech, and Music Processing, 4.
Galushkin, A. I. (2007). Neural network theory. Berlin, Heidelberg: Springer.
Google Scholar
Han, W. (2006). Speech recognition IC with an efficient MFCC features. The Chinese University of Hong Kong, Sept. 2006.
Haykin, S. (1999). Neural networks, 2nd ed. McMaster University, Hamilton, ON, Canada.
Kinnunen, T. (2003). Spectral Features for automatic text-independent speaker recognition. University of Joensuu, Department of Computer Science, Joenssuu, Finland.
Pullella, D., & Togneri, R. (2006). Speaker identification using higher order spectra. University of Western Australia.
Russo, M. (2005). Speech recognition over Bluetooth ACL and SCO links: A comparison. IEEE.
Shuling, L., & Wang, C. (2009) Nonspecific speech recognition method based on composite LVQ1 and LVQ2 network. Chinese Control and Decision Conference (CCDC), 2304–2388.
Trivedi, N., Kumar, V., Singh, S., Ahuja, S., & Chadha, R. (2011). Speech recognition by wavelet analysis. International Journal of Computer Applications, 15(8) (0975–8887) February 2011.

Download references

Author information

Authors and Affiliations

Department of Electronics and Communications Engineering, Tanta University, Tanta, Egypt
Ali A. Khalil & Mustafa M. Abd Elnaby
Department of Communications and Electronics, Helwan University, Helwan, 11795, Egypt
Elsayed M. Saad
Department of Electronics Engineering, Faculty of Engineering and Architecture, Ibb University, Ibb, Yemen
Azzam Y. Al-nahari
Department of Computer Engineering, Faculty of Engineering, Al-Balqa’ Applied University, Al-Salt , 19117, Jodan
Nayel Al-Zubi
Faculty of Industrial Education, Helwan University, Helwan, Egypt
Mohsen A. M. El-Bendary
Department of Electronics and Electrical Communications, Faculty of Electronic Engineering, Menoufia University, Menouf , 32952, Egypt
Fathi E. Abd El-Samie

Authors

Ali A. Khalil
View author publications
You can also search for this author in PubMed Google Scholar
Mustafa M. Abd Elnaby
View author publications
You can also search for this author in PubMed Google Scholar
Elsayed M. Saad
View author publications
You can also search for this author in PubMed Google Scholar
Azzam Y. Al-nahari
View author publications
You can also search for this author in PubMed Google Scholar
Nayel Al-Zubi
View author publications
You can also search for this author in PubMed Google Scholar
Mohsen A. M. El-Bendary
View author publications
You can also search for this author in PubMed Google Scholar
Fathi E. Abd El-Samie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fathi E. Abd El-Samie.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Khalil, A.A., Elnaby, M.M.A., Saad, E.M. et al. Efficient speaker identification from speech transmitted over Bluetooth networks. Int J Speech Technol 17, 409–416 (2014). https://doi.org/10.1007/s10772-014-9238-4

Download citation

Received: 22 July 2013
Accepted: 22 May 2014
Published: 19 June 2014
Issue Date: December 2014
DOI: https://doi.org/10.1007/s10772-014-9238-4

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Speaker Identification in a Multi-speaker Environment

Speaker Identification Using MFCC Feature Extraction ANN Classification Technique

Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Efficient speaker identification from speech transmitted over Bluetooth networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Speaker Identification in a Multi-speaker Environment

Speaker Identification Using MFCC Feature Extraction ANN Classification Technique

Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation