Nothing Special   »   [go: up one dir, main page]

Skip to main content

Distributed Speech Recognition of Mandarin Digits String

  • Conference paper
Chinese Spoken Language Processing (ISCSLP 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

  • 1622 Accesses

Abstract

In this paper, the performance of the pitch detection algorithm in ETSI ES-202-212 XAFE standard is evaluated on a Mandarin digit string recognition task. Experimental results showed that the performance of the pitch detection algorithm degraded seriously when the SNR of speech signal was lower than 10dB. This makes the recognizer using pitch information perform inferior to the original recognizer without using pitch information in low SNR environments. A modification of the pitch detection algorithm is therefore proposed to improve the performance of pitch detection in low SNR environments. The recognition performance can be improved for most SNR levels by integrating the recognizers with and without using pitch information. Overall recognition rates of 82.1% and 86.8% were achieved for clean and multi-condition training cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; Extended advanced front-end feature extraction algorithm; Compression algorithms; Back-end reconstruction algorithm, ETSI Standard ES 202 212 (November 2003)

    Google Scholar 

  2. DSR Front-end Extension for Tonal-language Recognition and Speech Reconstruction. Aurora Group Meeting, by IBM & Motorola (April 2003), http://portal.etsi.org/stq/DSR_Presentations/Presentation.pps

  3. Lin, W.-y., Lee, L.-S.: Improved Tone Recognition for Fluent Mandarin Speech Based on New Inter-Syllabic Features and Robust Pitch Extraction. In: IEEE 8th Automatic Speech Recognition and Understanding Workshop, St. Thomas, US Virgin Islands, USA, December 2003, pp. 237–242 (2003)

    Google Scholar 

  4. AURORA Database, http://www.elda.org/article20.html

  5. Test and Processing plan for default codec evaluation for speech enabled services (SES), Tdoc S4-030395, 3GPP TSG SA4 meeting #26, Paris, France (May 5-9, 2003)

    Google Scholar 

  6. Lyu, D.-C., Liang, M.-S., Chiang, Y.-C., Hsu, C.-N., Lyu, R.-Y.: Large Vocabulary Taiwanese (Min-nan) Speech Recognition Using Tone Features and Statistical Pronunciation Modeling. In: Eurospeech 2003, Geneva, pp. 1861–1864 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, YR., Lu, BX., Liao, YF., Chen, SH. (2006). Distributed Speech Recognition of Mandarin Digits String. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_40

Download citation

  • DOI: https://doi.org/10.1007/11939993_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-49665-6

  • Online ISBN: 978-3-540-49666-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics