HTK-based speech recognition and corpus-based English vocabulary online guiding system

Lu Yang¹

109 Accesses
1 Citation
Explore all metrics

Abstract

With the popularization of computers and the development of modern educational technology, the connection between corpus and foreign language intelligent guiding is getting closer and closer. Corpus was first used in vocabulary guiding in foreign language guiding, and there are many research achievements in this field. However, in practical guiding, English vocabulary teaching is a big problem faced by teachers and students. This thesis mainly studies the English vocabulary online instruction system from the perspective of speech recognition. English vocabulary online guidance system has become an essential tool for English learners to learn vocabulary. Speech recognition technology is the technology that converts speech signals into text. Automatic speech recognition is also known as speech recognition or computer speech recognition, its goal is to let the computer can recognize the continuous speech that different people speak, to achieve the conversion of voice to text. Speech recognition is a comprehensive technology that integrates many subjects, including phonetics, linguistics, computer science and so on. Hence, this paper analyzes the HTK speech recognition technology and the construction of the corpus, and studies the English vocabulary online guidance system. The novel speech analysis technology is considered for the implementations of the novel guiding system. Through the comparison simulations compared with the other state-of-the-art systems, the designed outperformed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A System of English Spoken Pronunciation Learning Based on Speech Recognition and Mobile Phone Platform

Automatic Speech Recognition System to Support Aural Vocabulary Learning

Dynamic out-of-vocabulary word registration to language model for speech recognition

Article Open access 25 January 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Asbai, N., & Amrouche, A. (2017). Boosting scores fusion approach using Front-End Diversity and adaboost Algorithm, for speaker verification. Computers and Electrical Engineering, 62, 250–257.
Article Google Scholar
Boxin, J. (2016). The Enlightenment of cognitive linguistics on English Vocabulary Teaching. Overseas English, 320(04), 176–177.
Google Scholar
Chang, X., Zhang, W., Qian, Y., Roux, J. L., & Watanabe, S. (2020). End-to-end multi-speaker speech recognition with transformer. In ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6134–6138). IEEE
Chenchen, G., & Ying, W. (2016). Practical exploration of practical teaching materials construction of online English education. Scientific Guide, 000(009), 5–5.
Google Scholar
Haiyan, Z. (2017). Exploration of network foreign language teaching and education mode innovation. English Square, 079(07), 86–87.
Google Scholar
Hrinchuk, O., Popova, M., & Ginsburg, B. (2020). Correction of automatic speech recognition with transformer sequence-to-sequence model. In ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 7074–7078). IEEE
Jiying, M. (2017). Problems and countermeasures of middle school English Vocabulary Teaching under the concept of new curriculum. Journal of Prose
Li, M., Zorilă, C., & Doddipatla, R. (2021). Transformer-based online speech recognition with decoder-end adaptive computation steps. In 2021 IEEE spoken language technology workshop (SLT) (pp. 1–7). IEEE
Li, Z. (2016). Speech signal processing. China Machine Press
Moritz, N., Hori, T., & Roux, J. L. (2021). Semi-supervised speech recognition via graph-based temporal classification. In ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6548–6552). IEEE
Ng, E. H., Ning, & Rönnberg, J. (2020). Hearing aid experience and background noise affect the robust relationship between working memory and speech recognition in noise. International Journal of Audiology, 59(3), 208–218.
Article Google Scholar
Ochiai, T., Watanabe, S., Hori, T., & Hershey, J. R. (2017). Multichannel end-to-end speech recognition. . In International conference on machine learning (pp. 2632–2641). PMLR
Okamoto, A. (2017). Speech recognition system, recognition dictionary registration system, and acoustic model identifier series generation apparatus.
Ravanelli, M., Zhong, J., Pascual, S., Swietojanski, P., Monteiro, J., Trmal, J., & Bengio, Y. (2020). Multi-task self-supervised learning for robust speech recognition. In ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 6989–6993). IEEE
Shambhu, S. B., Gupta, M., & Agarwal, S. (2018). SVM based voice activity detection by fusing a new acoustic feature PLMS with some existing acoustic features of speech. Journal of Intelligent & Fuzzy Systems, 35(2), 901–908.
Google Scholar
Shaolong, H. (2016). Research on connected digital speech recognition based on HTK. Shanxi Electronic Technology, 05, 86–88.
Google Scholar
Sithara, A., Thomas, A., & Mathew, D. (2018). Study of MFCC and IHC feature extraction methods with probabilistic acoustic models for speaker biometric applications. Procedia Computer Science, 143, 648–662.
Google Scholar
Wang, Y., Mohamed, A., Le, D., Liu, C., Xiao, A., Mahadeokar, J., Huang, H., Tjandra, A., Zhang, X., Zhang, F., & Fuegen, C. (2020). Transformer-based acoustic modeling for hybrid speech recognition. In ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 6874–6878). IEEE
Wong, J. H. M., Gaur, Y., Zhao, R., Lu, L., Sun, E., Li, J., & Gong, Y. (2020). Combination of end-to-end and hybrid models for speech recognition. In Interspeech (pp. 1783–1787).
Wu, H., Wang, Y., & Huang, J. (2017). Identification of reconstructed speech. ACM Transactions on Multimedia Computing Communications and Applications, 13(1), 1–20.
Article Google Scholar
Xu, Z. (2016). A preliminary study on the teaching strategies of English vocabulary in senior high school. New curriculum, middle school (12)
Xuhong, Z. (2015). Study on the identification of risk based on voice emotion. Guangdong University of Technology
Yang, C.-H. H., Qi, J., Chen, S. Y.-C., Chen, P.-Y., Siniscalchi, S. M., Ma, X., & Lee, C.-H. (2021). Decentralizing feature extraction with quantum convolutional neural network for automatic speech recognition. In ICASSP 2021–2021 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 6523–6527). IEEE
Yanzhan, Wu. (2016). Improved speech recognition system based on HMM and genetic neural network. Computer System Application, 01, 204–208.
Google Scholar
Yi, Du. (2020). Exploration of English education based on network environment. Chinese and Foreign Entrepreneurs, 678(16), 170–170.
Google Scholar
Zhenhua, L. (2017). The “plate” nature of English vocabulary and Its Enlightenment on English Teaching. Campus English, 000(001), 41–41.
Google Scholar
Zhenhua, P. (2019). Research on the application of conceptual metaphor in high school English Vocabulary Teaching. Campus English (26)

Download references

Funding

(1) The school-level teaching reform project of Chongqing University of Education "Research and Practice on the Training Model of Speaking and Writing Ability for Applied Undergraduate Business English Majors Based on the Hypothesis of "Output Drive-Input Promotion"" JD2017042. (2) Chongqing Educational Science "13th Five-Year Plan" General Project in 2018 "Action Research on the Core Competence and Professional Development Path of Undergraduate Business English Teachers under the Background of "One Belt One Road"-Taking Chongqing Universities as an Example", 2018-GX-333.

Author information

Authors and Affiliations

Chongqing University of Education, Chongqing, 400065, China
Lu Yang

Authors

Lu Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lu Yang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, L. HTK-based speech recognition and corpus-based English vocabulary online guiding system. Int J Speech Technol 25, 921–931 (2022). https://doi.org/10.1007/s10772-022-09968-7

Download citation

Received: 25 March 2021
Accepted: 14 March 2022
Published: 30 May 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s10772-022-09968-7

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A System of English Spoken Pronunciation Learning Based on Speech Recognition and Mobile Phone Platform

Automatic Speech Recognition System to Support Aural Vocabulary Learning

Dynamic out-of-vocabulary word registration to language model for speech recognition

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

HTK-based speech recognition and corpus-based English vocabulary online guiding system

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A System of English Spoken Pronunciation Learning Based on Speech Recognition and Mobile Phone Platform

Automatic Speech Recognition System to Support Aural Vocabulary Learning

Dynamic out-of-vocabulary word registration to language model for speech recognition

Explore related subjects

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation