Abstract
A new concept of making the Consonantal Recognition is proposed in this work, where used units (phonemes and syllables) to make the word recognition. This concept was carried out by a hierarchical decision structure, based on the Articulatory Phonetics and SVM. The speech features used were MFCC and WPT. Eighteen consonantal phonemes have been used in the recognition. The database used for the recognition was a set of two-syllable words of the Brazilian Portuguese language. The experimental results showed success rates of 98.41% for the user-dependent case. Our focus was the dependent speaker in order to validate the new proposal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Raman, S., Yegnanarayana, B.: Performance of Isolated Word Recognition System for Confusable Vocabulary. In: IEEE Int. Conf. on Acoustics, Speech and Signal Processing, ICASSP 1984, vol. 09, Part 1, pp. 17–20 (1984)
Ruske, G.: Automatic Recognition of Syllabic Speech Segments using Spectral and Temporal Features. In: IEEE, Proceedings ICASSP, vol. 01, pp. 550–553 (1982)
Abdelatty Ali, A.M., Spiegel, J.V.D., Mueller, P., Berman, J.: An Acoustic-phonetic Feature-based System for Automatic Phoneme Recognition in Continuous Speech. In: IEEE Int. Symposium on Circuits and Systems, ISCAS 1999, vol. 03, pp. 118–121 (1999)
de A. Bresolin, A., Neto, A.D.D., Alsina, P.J.: A New Hierarchical Decision Structure Using Wavelet Packet and SVM for Brazilian Phonemes Recognition. In: King, I., Wang, J., Chan, L.-W., Wang, D. (eds.) ICONIP 2006, Part II. LNCS, vol. 4233, pp. 159–166. Springer, Heidelberg (2006)
International Phonetic Association – IPA, http://www2.arts.gla.ac.uk/IPA/ipachart.html
Combrinck, H.P., Botha, E.C.: On The Mel-scaled Cepstrum. In: Proc. of the Seventh Annual South African Workshop on Pattern Recognition, University of Cape Town (1996)
Burrus, C.S., Gopinath, R.A., Guo, H.: Introduction to Wavelets and Wavelet Transforms. Prentice Hall, New Jersey (1998)
Chan, C.P., Ching, P.C., Lee, T.: Noisy Speech Recognition using de-noised Multiresolution Analysis Acoustic Features. Journal Acoustical Society of America 110(05), pt. 01, 2567–2574 (2001)
Vapnik, V.N.: Principles of Risk Minimization for Learning Theory. In: Advances in Neural Information Processing Systems, vol. 04, San Mateo, CA, pp. 831–838 (1992)
Deshmukh, O., Espy-Wilson, C.Y., Juneja, A.: Acoustic Phonetic Speech Parameters for Speaker-Independent Speech Recognition. In: ICASSP 2002 Speech Processing nº 2162 (May 2002)
Malbos, F., Baudry, M., Montresor, S.: Detection of Stop Consonants with the Wavelet Transform. In: Proc. of the IEEE-SP Int. Symposium on Time-Frequency and Time-Scale Analysis, pp. 612–615 (October 1994)
Farooq, O., Datta, S.: Phoneme Recognition using Wavelet based Features. Information Sciences 150(1-2), 5–15 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
de Andrade Bresolin, A., Del Monego, H.I. (2012). Consonantal Recognition Using SVM and a Hierarchical Decision Structure Based in the Articulatory Phonetics. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7664. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34481-7_80
Download citation
DOI: https://doi.org/10.1007/978-3-642-34481-7_80
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34480-0
Online ISBN: 978-3-642-34481-7
eBook Packages: Computer ScienceComputer Science (R0)