A New Step in Arabic Speech Identification: Spoken Digit Recognition

Khalid Saeed³ &
Mohammad K. Nammous³

Abstract

This work presents a new Algorithm to recognize separate voices of some Arabic words, the digits form zero to ten. Firstly we prepare our signal by pre-processing trial. Next the speech signal is processed as an image by Power Spectrum Estimation. For feature extraction, transformation and hence recognition, the algorithm of minimal eigenvalues of Toeplitz matrices together with other methods of speech processing and recognition are used. At the stage of classification many methods are tested from classical ones, which depend on the matrix theory, to different types of neuron networks, mainly radial basis functions neural networks. The success rate obtained in the presented experiments is almost ideal and exceeded 98% for many cases. The results have shown flexibility to extend the algorithm to speaker identification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

6 References

K. Saeed, M. Nammous, “Experimental Image-Based Algorithm for Spoken Arabic Digits Identification,” Computer Information Systems and Applications, Vol.1, pp.55–66, WSFiZ Press, Bialystok, Poland 2004.
Google Scholar
K. Saeed, “Computer Graphics Analysis: A Criterion for Image Feature Extraction and Recognition,” Vol. 10, Issue 2, 2001, pp. 185–194, MGV-International Journal on Machine Graphics and Vision, Institute of Computer Science, Polish Academy of Sciences, Warsaw.
Google Scholar
R. W. Schafer, L. R. Rabiner, “System for Automatic Formant Analysis of Voiced Speech,” /J. Acoust. Soc. Amer. Vol.47, Feb. 1970.
Google Scholar
Andreas A., “Digital Filters: Analysis and Design,” McGraw-Hill, New York 1979.
Google Scholar
Cz. Basztura, “Modele analizy i procedury w komputerowym rozpoznawaniu głosów,” (in Polish), prace naukowe ITiA Politechniki Wrocławskiej, no. 30, Wrocław 1989.
Google Scholar
L. S. Marple, “Digital Spectral Analysis,” Englewood Cliffs, NJ: Prentice Hall, 1987.
Google Scholar
Sadaoki Furui, “Digital Speech Processing, Synthesis, and Recognition,” Marcel Dekker, Inc. 2001.
Google Scholar
R. Tadeusiewicz, “Sygnał mowy,” WKiŁ (in Polish), Warsaw 1988.
Google Scholar
V. K. Ingle, J. G. Proakis, “Digital Signal Processing Using MATLAB,” Brooks Cole, July 1999.
Google Scholar
K. Saeed, M. Kozłowski, A. Kaczanowski, “Metoda do rozpoznawania obrazów akustycznych izolowanych liter mowy”, Zeszyty Politechniki Białostockiej (in Polish), I-1/2002, pp. 181–207, Bialystok 2002.
Google Scholar
K. Saeed, M. Kozłowski, “An Image-Based System for Spoken-Letter Recognition,” 10th Int. Conference CAIP'03 on Computer Analysis of Images and Patterns, August 2003, Groningen. Proceedings published in: Lecture Notes in Computer Science, Petkov and Westenberg (Eds.), pp. 494–502, LNCS 2756, Springer-Verlag Heidelberg: Berlin 2003.
Google Scholar
K. Saeed, M. Tabedzki, “A New Hybrid System for Recognition of Handwritten-Script,” Invited for publication in International Scientific Journal of Computing, Institute of Computer Information Technologies, Volume 3, Issue 1, pp. 50–57, 2004, Ternopil, Ukraine 2004.
Google Scholar
Shigeru Katagiri, “Handbook of Neural Networks for Speech Processing,” Artech House, Boston 2000.
Google Scholar
M.W. Mak, W.G. Allen and G.G. Sexton, “Speaker identification using radial basis functions”, The Third International Conference on Artifical Neural networks, University of Northumbria at Newcastle, U.K 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computer Science, Bialystok Technical University, Wiejska 45A, 15-351, Bialystok, Poland
Khalid Saeed & Mohammad K. Nammous

Authors

Khalid Saeed
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad K. Nammous
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Białystock Technical University, Poland
Khalid Saeed
Technical University of Szczecin, Poland
Jerzy Pejaś

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saeed, K., Nammous, M.K. (2005). A New Step in Arabic Speech Identification: Spoken Digit Recognition. In: Saeed, K., Pejaś, J. (eds) Information Processing and Security Systems. Springer, Boston, MA. https://doi.org/10.1007/0-387-26325-X_6

Download citation

DOI: https://doi.org/10.1007/0-387-26325-X_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-25091-5
Online ISBN: 978-0-387-26325-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics