An SVM-Based Mandarin Pronunciation Quality Assessment System

Fengpei Ge⁴,
Fuping Pan⁴,
Changliang Liu⁴,
Bin Dong⁴,
Shui-duen Chan⁵,
Xinhua Zhu⁵ &
…
Yonghong Yan⁴

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 56))

1460 Accesses
6 Citations

Abstract

This paper presents our Mandarin pronunciation quality assessment system for the examination of Putonghua Shuiping Kaoshi (PSK) and investigates a novel Support Vector Machine (SVM) based method to improve its assessment accuracy. Firstly, an selective speaker adaptation module is introduced, in which we select well pronounced speech from results of the first-pass automatic pronunciation scoring as the adaptation data, and adopt Maximum Likelihood Linear Regression to update the acoustic model (AM). Then, compared with the traditional triphone based AM, the monophone based AM is studied. Finally, we propose a new method of incorporating all kinds of posterior probabilities using SVM classifier. Experimental results show that the average correlation coefficient between machine and human scores is improved from 83.72% to 85.48%. It suggests that the two methods of selective speaker adaptation and multi-model combination using SVM are very effective to improve the accuracy of pronunciation quality assessment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Enhancing Automatic Speech Recognition for Punjabi Dialects: An Experimental Analysis of Incorporating Prosodic Features and Acoustic Variability Mitigation

Article 01 August 2024

A Text-Independent Method for Estimating Pronunciation Quality of Chinese Students

Research on Spoken English Speech Recognition Technology Based on MATLAB

References

Neumeyer, L., Franco, H., Weintraub, M., Price, P.: Automatic Text-Independent Pronunciation Scoring of Foreign Language Student Speech. In: Proc. of ICSLP 1996, Philadelphia, Pennsylvania, pp. 1457–1460 (1996)
Google Scholar
Tatsuya, K., Masatake, D., Yasushi, T.: Practical Use of English Pronunciation System for Japanese Students in the CALL Classroom. In: INTERSPEECH 2004, pp. 1689–1692 (2004)
Google Scholar
Franco, H., Neumeyer, L., Kim, Y., Ronen, O.: Automatic Pronunciation Scoring for Language Instruction. In: Proc. Int’l. Conf. on Acoust., Speech and Signal Processing, Munich, pp. 1471–1474 (1997)
Google Scholar
Neumeyer, L., Franco, H., Digalakis, V., Weintraub, M.: Automatic Scoring of Pronunciation Quality. Speech Communication 30(2-3), 83–93 (2000)
Article Google Scholar
Franco, H., Neumeyer, L., Digalakis, V., Ronen, O.: Combination of Machine Scores for Automatic Grading of Pronunciation Quality. Speech Communication 30 (2000)
Google Scholar
Witt, S.M., Young, S.J.: Phone-level Pronunciation Scoring and Assessment for Interactive Language Learning. Speech communication 30(2)-32(3), 95–108 (2000)
Article Google Scholar
Bernstein, J., Cohen, M., Murveit, H., Rtischev, D., Weintraub, M.: Automatic Evaluation and Training in English Pronunciation. In: ICSLP Kobe, Japan (1990)
Google Scholar
Chen, J.C., Jang, J.S.R., Li, J.Y., Wu, M.J.: Automatic Pronunciation Assessment for Mandarin Chinese. In: IEEE International Conference on Multimedia and Expo., Taipei, Taiwan (June 2004)
Google Scholar
Pan, F.P., Zhao, Q.W., Yan, Y.H.: Improvements in Tone Pronunciation Scoring for Strongly Accented Mandarin Speech. In: Proceedings of ISCSLP 2006, pp. 592–602 (2006)
Google Scholar
Leggetter, C., Woodland, P.: Speaker adaptation of HMMs using linear regression. Technical Report CUED/F-INFENG/TR. 181. Cambridge University Engineering Department, Cambridge, UK (1994)
Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
MATH Google Scholar
http://www.csie.ntu.edu.tw/~cjlin/libsvm/
http://ntu.csie.org/~piaip/svm/svm_tutorial.html
Implementation Outline for Putonghua Shuiping Kaoshi. Commercial Press, Beijing (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

ThinkIT Speech Lab, Institute of Acoustics, Chinese Academy of Sciences, Beijing, 100871, P.R. China
Fengpei Ge, Fuping Pan, Changliang Liu, Bin Dong & Yonghong Yan
Department of Chinese & Bilingual Studies, the Hong Kong Polytechnic University,
Shui-duen Chan & Xinhua Zhu

Authors

Fengpei Ge
View author publications
You can also search for this author in PubMed Google Scholar
Fuping Pan
View author publications
You can also search for this author in PubMed Google Scholar
Changliang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Bin Dong
View author publications
You can also search for this author in PubMed Google Scholar
Shui-duen Chan
View author publications
You can also search for this author in PubMed Google Scholar
Xinhua Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yonghong Yan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Control Science and Engineering, Huazhong University of Science and Technology, No. 1037, Luoyu Road, 430074, Wuhan, Hubei, China
Hongwei Wang , Yi Shen & Zhigang Zeng , &
Texas A&M University at Qatar, PO Box 23874, Doha, Qatar,
Tingwen Huang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ge, F. et al. (2009). An SVM-Based Mandarin Pronunciation Quality Assessment System. In: Wang, H., Shen, Y., Huang, T., Zeng, Z. (eds) The Sixth International Symposium on Neural Networks (ISNN 2009). Advances in Intelligent and Soft Computing, vol 56. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01216-7_27

Download citation

DOI: https://doi.org/10.1007/978-3-642-01216-7_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01215-0
Online ISBN: 978-3-642-01216-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

An SVM-Based Mandarin Pronunciation Quality Assessment System

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enhancing Automatic Speech Recognition for Punjabi Dialects: An Experimental Analysis of Incorporating Prosodic Features and Acoustic Variability Mitigation

A Text-Independent Method for Estimating Pronunciation Quality of Chinese Students

Research on Spoken English Speech Recognition Technology Based on MATLAB

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

An SVM-Based Mandarin Pronunciation Quality Assessment System

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enhancing Automatic Speech Recognition for Punjabi Dialects: An Experimental Analysis of Incorporating Prosodic Features and Acoustic Variability Mitigation

A Text-Independent Method for Estimating Pronunciation Quality of Chinese Students

Research on Spoken English Speech Recognition Technology Based on MATLAB

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation