Abstract
This work presents a new Algorithm to recognize separate voices of some Arabic words, the digits form zero to ten. Firstly we prepare our signal by pre-processing trial. Next the speech signal is processed as an image by Power Spectrum Estimation. For feature extraction, transformation and hence recognition, the algorithm of minimal eigenvalues of Toeplitz matrices together with other methods of speech processing and recognition are used. At the stage of classification many methods are tested from classical ones, which depend on the matrix theory, to different types of neuron networks, mainly radial basis functions neural networks. The success rate obtained in the presented experiments is almost ideal and exceeded 98% for many cases. The results have shown flexibility to extend the algorithm to speaker identification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
6 References
K. Saeed, M. Nammous, “Experimental Image-Based Algorithm for Spoken Arabic Digits Identification,” Computer Information Systems and Applications, Vol.1, pp.55–66, WSFiZ Press, Bialystok, Poland 2004.
K. Saeed, “Computer Graphics Analysis: A Criterion for Image Feature Extraction and Recognition,” Vol. 10, Issue 2, 2001, pp. 185–194, MGV-International Journal on Machine Graphics and Vision, Institute of Computer Science, Polish Academy of Sciences, Warsaw.
R. W. Schafer, L. R. Rabiner, “System for Automatic Formant Analysis of Voiced Speech,” /J. Acoust. Soc. Amer. Vol.47, Feb. 1970.
Andreas A., “Digital Filters: Analysis and Design,” McGraw-Hill, New York 1979.
Cz. Basztura, “Modele analizy i procedury w komputerowym rozpoznawaniu głosów,” (in Polish), prace naukowe ITiA Politechniki Wrocławskiej, no. 30, Wrocław 1989.
L. S. Marple, “Digital Spectral Analysis,” Englewood Cliffs, NJ: Prentice Hall, 1987.
Sadaoki Furui, “Digital Speech Processing, Synthesis, and Recognition,” Marcel Dekker, Inc. 2001.
R. Tadeusiewicz, “Sygnał mowy,” WKiŁ (in Polish), Warsaw 1988.
V. K. Ingle, J. G. Proakis, “Digital Signal Processing Using MATLAB,” Brooks Cole, July 1999.
K. Saeed, M. Kozłowski, A. Kaczanowski, “Metoda do rozpoznawania obrazów akustycznych izolowanych liter mowy”, Zeszyty Politechniki Białostockiej (in Polish), I-1/2002, pp. 181–207, Bialystok 2002.
K. Saeed, M. Kozłowski, “An Image-Based System for Spoken-Letter Recognition,” 10th Int. Conference CAIP'03 on Computer Analysis of Images and Patterns, August 2003, Groningen. Proceedings published in: Lecture Notes in Computer Science, Petkov and Westenberg (Eds.), pp. 494–502, LNCS 2756, Springer-Verlag Heidelberg: Berlin 2003.
K. Saeed, M. Tabedzki, “A New Hybrid System for Recognition of Handwritten-Script,” Invited for publication in International Scientific Journal of Computing, Institute of Computer Information Technologies, Volume 3, Issue 1, pp. 50–57, 2004, Ternopil, Ukraine 2004.
Shigeru Katagiri, “Handbook of Neural Networks for Speech Processing,” Artech House, Boston 2000.
M.W. Mak, W.G. Allen and G.G. Sexton, “Speaker identification using radial basis functions”, The Third International Conference on Artifical Neural networks, University of Northumbria at Newcastle, U.K 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this paper
Cite this paper
Saeed, K., Nammous, M.K. (2005). A New Step in Arabic Speech Identification: Spoken Digit Recognition. In: Saeed, K., Pejaś, J. (eds) Information Processing and Security Systems. Springer, Boston, MA. https://doi.org/10.1007/0-387-26325-X_6
Download citation
DOI: https://doi.org/10.1007/0-387-26325-X_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-25091-5
Online ISBN: 978-0-387-26325-0
eBook Packages: Computer ScienceComputer Science (R0)