Nothing Special   »   [go: up one dir, main page]

Skip to main content

Recognition of Greek Phonemes Using Support Vector Machines

  • Conference paper
Advances in Artificial Intelligence (SETN 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3955))

Included in the following conference series:

Abstract

In the present work we study the applicability of Support Vector Machines (SVMs) on the phoneme recognition task. Specifically, the Least Squares version of the algorithm (LS-SVM) is employed in recognition of the Greek phonemes in the framework of telephone-driven voice-enabled information service. The N-best candidate phonemes are identified and consequently feed to the speech and language recognition components. In a comparative evaluation of various classification methods, the SVM-based phoneme recognizer demonstrated a superior performance. Recognition rate of 74.2% was achieved from the N-best list, for N=5, prior to applying the language model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Zissman, M.: Comparison of four Approaches to Automatic Language Identification of Telephone Speech. IEEE Trans. Speech and Audio Proc. 4, 31–44 (1996)

    Article  Google Scholar 

  2. Mak, M.: Combining ANNs to improve phone recognition. In: IEEE ICASSP 1997, Munich, Germany, vol. 4, pp. 3253–3256 (1997)

    Google Scholar 

  3. Kwon, O.-W., Lee, T.-W.: Phoneme recognition using ICA-based feature extraction and transformation. Signal Processing (June 2004)

    Google Scholar 

  4. Caseiro, D., Trancoso, I.: Identification of Spoken European Languages. In: Eusipco, IX European Signal Processing Conference, Greece (September 1998)

    Google Scholar 

  5. Yan, Y., Barnard, E.: Experiments for an approach to Language Identification with conversational telephone speech. In: ICASSP, Atlanta, USA, May 1996, vol. 1, pp. 789–792 (1996)

    Google Scholar 

  6. Lee, K.-F., Hon, H.-W.: Speaker Independent Phone Recognition using HMM. IEEE Trans. on Acoustics Speech and Audio Processing 37(11) (November 1989)

    Google Scholar 

  7. Schultz, T., Waibel, A.: Language Independent and Language Adaptive Acoustic Modeling for Speech Recognition. Speech Communication 35(1-2), 31–51 (2001)

    Article  MATH  Google Scholar 

  8. Dalsgaard, P., Andersen, O., Hesselager, H., Petek, B.: Language Identification using Language-dependent phonemes and Language-independent speech units. In: ICSLP 1996, pp. 1808–1811 (1996)

    Google Scholar 

  9. Corredor-Ardoy, C., Gauvain, J., Adda_decker, M., Lamel, L.: Language Identification with Language-independent acoustic models. In: Proc. of EUROSPEECH 1997 (September 1997)

    Google Scholar 

  10. Martin, T., Wong, E., Baker, B., Mason, M.: Pitch and Energy Trajectory Modeling in a Syllable Length Temporal Framework for Language Identification. In: ODYSSEY 2004, Toledo, Spain, May 31-June 3 (2004)

    Google Scholar 

  11. Pusateri, E., Thong, J.M.: N-best List Generation using Word and Phoneme Recognition Fusion. In: 7th European Conference on Speech Communication and Technology (EuroSpeech), Aalborg, Denmark (September 2001)

    Google Scholar 

  12. Salomon, J., King, S., Osborne, M.: Framewise phone classification using support vector machines. In: Proceedings International Conference on Spoken Language Processing, Denver (2002)

    Google Scholar 

  13. Garofolo, J.: Getting started with the DARPA-TIMIT CD-ROM: An acoustic phonetic continuous speech database. National Institute of Standards and Technology (NIST), Gaithersburgh, MD, USA (1988)

    Google Scholar 

  14. Friedman, J.: Another approach to polychotomous classification. Technical report, Stanford University, UA (1996)

    Google Scholar 

  15. Suykens, J., Vandewalle, J.: Least Squares Support Vector Machine Classifiers. Neural Processing Letters 9(3), 293–300 (1999)

    Article  MATH  Google Scholar 

  16. Hodge, H.: SpeechDat multilingual speech databases for teleservices: across the finish line. In: EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999, vol. 6, pp. 2699–2702 (1999)

    Google Scholar 

  17. Chatzi, I., Fakotakis, N., Kokkinakis, G.: Greek speech database for creation of voice driven teleservices. In: EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997, vol. 4, pp. 1755–1758 (1997)

    Google Scholar 

  18. John, G., Langley, P.: Estimating Continuous Distributions in Bayesian Classifiers. In: 11th Conference on Uncertainty in Artificial Intelligence, pp. 338–345. Morgan Kaufmann, San Mateo (1995)

    Google Scholar 

  19. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)

    Google Scholar 

  20. Quinlan, J.R.: Bagging, Boosting, and C4.5, vol. 1. AAAI/IAAI (1996)

    Google Scholar 

  21. Witten, I., Frank, E.: Data Mining: Practical machine learning tools with Java implementations. Morgan Kaufmann, San Francisco (1999)

    Google Scholar 

  22. Burges, C.: A tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery 2(2), 121–167 (1998)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mporas, I., Ganchev, T., Zervas, P., Fakotakis, N. (2006). Recognition of Greek Phonemes Using Support Vector Machines. In: Antoniou, G., Potamias, G., Spyropoulos, C., Plexousakis, D. (eds) Advances in Artificial Intelligence. SETN 2006. Lecture Notes in Computer Science(), vol 3955. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11752912_30

Download citation

  • DOI: https://doi.org/10.1007/11752912_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-34117-8

  • Online ISBN: 978-3-540-34118-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics