Abstract
With the popularization of computers and the development of modern educational technology, the connection between corpus and foreign language intelligent guiding is getting closer and closer. Corpus was first used in vocabulary guiding in foreign language guiding, and there are many research achievements in this field. However, in practical guiding, English vocabulary teaching is a big problem faced by teachers and students. This thesis mainly studies the English vocabulary online instruction system from the perspective of speech recognition. English vocabulary online guidance system has become an essential tool for English learners to learn vocabulary. Speech recognition technology is the technology that converts speech signals into text. Automatic speech recognition is also known as speech recognition or computer speech recognition, its goal is to let the computer can recognize the continuous speech that different people speak, to achieve the conversion of voice to text. Speech recognition is a comprehensive technology that integrates many subjects, including phonetics, linguistics, computer science and so on. Hence, this paper analyzes the HTK speech recognition technology and the construction of the corpus, and studies the English vocabulary online guidance system. The novel speech analysis technology is considered for the implementations of the novel guiding system. Through the comparison simulations compared with the other state-of-the-art systems, the designed outperformed.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Asbai, N., & Amrouche, A. (2017). Boosting scores fusion approach using Front-End Diversity and adaboost Algorithm, for speaker verification. Computers and Electrical Engineering, 62, 250–257.
Boxin, J. (2016). The Enlightenment of cognitive linguistics on English Vocabulary Teaching. Overseas English, 320(04), 176–177.
Chang, X., Zhang, W., Qian, Y., Roux, J. L., & Watanabe, S. (2020). End-to-end multi-speaker speech recognition with transformer. In ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6134–6138). IEEE
Chenchen, G., & Ying, W. (2016). Practical exploration of practical teaching materials construction of online English education. Scientific Guide, 000(009), 5–5.
Haiyan, Z. (2017). Exploration of network foreign language teaching and education mode innovation. English Square, 079(07), 86–87.
Hrinchuk, O., Popova, M., & Ginsburg, B. (2020). Correction of automatic speech recognition with transformer sequence-to-sequence model. In ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 7074–7078). IEEE
Jiying, M. (2017). Problems and countermeasures of middle school English Vocabulary Teaching under the concept of new curriculum. Journal of Prose
Li, M., Zorilă, C., & Doddipatla, R. (2021). Transformer-based online speech recognition with decoder-end adaptive computation steps. In 2021 IEEE spoken language technology workshop (SLT) (pp. 1–7). IEEE
Li, Z. (2016). Speech signal processing. China Machine Press
Moritz, N., Hori, T., & Roux, J. L. (2021). Semi-supervised speech recognition via graph-based temporal classification. In ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6548–6552). IEEE
Ng, E. H., Ning, & Rönnberg, J. (2020). Hearing aid experience and background noise affect the robust relationship between working memory and speech recognition in noise. International Journal of Audiology, 59(3), 208–218.
Ochiai, T., Watanabe, S., Hori, T., & Hershey, J. R. (2017). Multichannel end-to-end speech recognition. . In International conference on machine learning (pp. 2632–2641). PMLR
Okamoto, A. (2017). Speech recognition system, recognition dictionary registration system, and acoustic model identifier series generation apparatus.
Ravanelli, M., Zhong, J., Pascual, S., Swietojanski, P., Monteiro, J., Trmal, J., & Bengio, Y. (2020). Multi-task self-supervised learning for robust speech recognition. In ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 6989–6993). IEEE
Shambhu, S. B., Gupta, M., & Agarwal, S. (2018). SVM based voice activity detection by fusing a new acoustic feature PLMS with some existing acoustic features of speech. Journal of Intelligent & Fuzzy Systems, 35(2), 901–908.
Shaolong, H. (2016). Research on connected digital speech recognition based on HTK. Shanxi Electronic Technology, 05, 86–88.
Sithara, A., Thomas, A., & Mathew, D. (2018). Study of MFCC and IHC feature extraction methods with probabilistic acoustic models for speaker biometric applications. Procedia Computer Science, 143, 648–662.
Wang, Y., Mohamed, A., Le, D., Liu, C., Xiao, A., Mahadeokar, J., Huang, H., Tjandra, A., Zhang, X., Zhang, F., & Fuegen, C. (2020). Transformer-based acoustic modeling for hybrid speech recognition. In ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 6874–6878). IEEE
Wong, J. H. M., Gaur, Y., Zhao, R., Lu, L., Sun, E., Li, J., & Gong, Y. (2020). Combination of end-to-end and hybrid models for speech recognition. In Interspeech (pp. 1783–1787).
Wu, H., Wang, Y., & Huang, J. (2017). Identification of reconstructed speech. ACM Transactions on Multimedia Computing Communications and Applications, 13(1), 1–20.
Xu, Z. (2016). A preliminary study on the teaching strategies of English vocabulary in senior high school. New curriculum, middle school (12)
Xuhong, Z. (2015). Study on the identification of risk based on voice emotion. Guangdong University of Technology
Yang, C.-H. H., Qi, J., Chen, S. Y.-C., Chen, P.-Y., Siniscalchi, S. M., Ma, X., & Lee, C.-H. (2021). Decentralizing feature extraction with quantum convolutional neural network for automatic speech recognition. In ICASSP 2021–2021 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 6523–6527). IEEE
Yanzhan, Wu. (2016). Improved speech recognition system based on HMM and genetic neural network. Computer System Application, 01, 204–208.
Yi, Du. (2020). Exploration of English education based on network environment. Chinese and Foreign Entrepreneurs, 678(16), 170–170.
Zhenhua, L. (2017). The “plate” nature of English vocabulary and Its Enlightenment on English Teaching. Campus English, 000(001), 41–41.
Zhenhua, P. (2019). Research on the application of conceptual metaphor in high school English Vocabulary Teaching. Campus English (26)
Funding
(1) The school-level teaching reform project of Chongqing University of Education "Research and Practice on the Training Model of Speaking and Writing Ability for Applied Undergraduate Business English Majors Based on the Hypothesis of "Output Drive-Input Promotion"" JD2017042. (2) Chongqing Educational Science "13th Five-Year Plan" General Project in 2018 "Action Research on the Core Competence and Professional Development Path of Undergraduate Business English Teachers under the Background of "One Belt One Road"-Taking Chongqing Universities as an Example", 2018-GX-333.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yang, L. HTK-based speech recognition and corpus-based English vocabulary online guiding system. Int J Speech Technol 25, 921–931 (2022). https://doi.org/10.1007/s10772-022-09968-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-022-09968-7