Nothing Special   »   [go: up one dir, main page]

Skip to main content

Advertisement

Log in

HTK-based speech recognition and corpus-based English vocabulary online guiding system

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

With the popularization of computers and the development of modern educational technology, the connection between corpus and foreign language intelligent guiding is getting closer and closer. Corpus was first used in vocabulary guiding in foreign language guiding, and there are many research achievements in this field. However, in practical guiding, English vocabulary teaching is a big problem faced by teachers and students. This thesis mainly studies the English vocabulary online instruction system from the perspective of speech recognition. English vocabulary online guidance system has become an essential tool for English learners to learn vocabulary. Speech recognition technology is the technology that converts speech signals into text. Automatic speech recognition is also known as speech recognition or computer speech recognition, its goal is to let the computer can recognize the continuous speech that different people speak, to achieve the conversion of voice to text. Speech recognition is a comprehensive technology that integrates many subjects, including phonetics, linguistics, computer science and so on. Hence, this paper analyzes the HTK speech recognition technology and the construction of the corpus, and studies the English vocabulary online guidance system. The novel speech analysis technology is considered for the implementations of the novel guiding system. Through the comparison simulations compared with the other state-of-the-art systems, the designed outperformed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Asbai, N., & Amrouche, A. (2017). Boosting scores fusion approach using Front-End Diversity and adaboost Algorithm, for speaker verification. Computers and Electrical Engineering, 62, 250–257.

    Article  Google Scholar 

  • Boxin, J. (2016). The Enlightenment of cognitive linguistics on English Vocabulary Teaching. Overseas English, 320(04), 176–177.

    Google Scholar 

  • Chang, X., Zhang, W., Qian, Y., Roux, J. L., & Watanabe, S. (2020). End-to-end multi-speaker speech recognition with transformer. In ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6134–6138). IEEE

  • Chenchen, G., & Ying, W. (2016). Practical exploration of practical teaching materials construction of online English education. Scientific Guide, 000(009), 5–5.

    Google Scholar 

  • Haiyan, Z. (2017). Exploration of network foreign language teaching and education mode innovation. English Square, 079(07), 86–87.

    Google Scholar 

  • Hrinchuk, O., Popova, M., & Ginsburg, B. (2020). Correction of automatic speech recognition with transformer sequence-to-sequence model. In ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 7074–7078). IEEE

  • Jiying, M. (2017). Problems and countermeasures of middle school English Vocabulary Teaching under the concept of new curriculum. Journal of Prose

  • Li, M., Zorilă, C., & Doddipatla, R. (2021). Transformer-based online speech recognition with decoder-end adaptive computation steps. In 2021 IEEE spoken language technology workshop (SLT) (pp. 1–7). IEEE

  • Li, Z. (2016). Speech signal processing. China Machine Press

  • Moritz, N., Hori, T., & Roux, J. L. (2021). Semi-supervised speech recognition via graph-based temporal classification. In ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6548–6552). IEEE

  • Ng, E. H., Ning, & Rönnberg, J. (2020). Hearing aid experience and background noise affect the robust relationship between working memory and speech recognition in noise. International Journal of Audiology, 59(3), 208–218.

    Article  Google Scholar 

  • Ochiai, T., Watanabe, S., Hori, T., & Hershey, J. R. (2017). Multichannel end-to-end speech recognition. . In International conference on machine learning (pp. 2632–2641). PMLR

  • Okamoto, A. (2017). Speech recognition system, recognition dictionary registration system, and acoustic model identifier series generation apparatus.

  • Ravanelli, M., Zhong, J., Pascual, S., Swietojanski, P., Monteiro, J., Trmal, J., & Bengio, Y. (2020). Multi-task self-supervised learning for robust speech recognition. In ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 6989–6993). IEEE

  • Shambhu, S. B., Gupta, M., & Agarwal, S. (2018). SVM based voice activity detection by fusing a new acoustic feature PLMS with some existing acoustic features of speech. Journal of Intelligent & Fuzzy Systems, 35(2), 901–908.

    Google Scholar 

  • Shaolong, H. (2016). Research on connected digital speech recognition based on HTK. Shanxi Electronic Technology, 05, 86–88.

    Google Scholar 

  • Sithara, A., Thomas, A., & Mathew, D. (2018). Study of MFCC and IHC feature extraction methods with probabilistic acoustic models for speaker biometric applications. Procedia Computer Science, 143, 648–662.

    Google Scholar 

  • Wang, Y., Mohamed, A., Le, D., Liu, C., Xiao, A., Mahadeokar, J., Huang, H., Tjandra, A., Zhang, X., Zhang, F., & Fuegen, C. (2020). Transformer-based acoustic modeling for hybrid speech recognition. In ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 6874–6878). IEEE

  • Wong, J. H. M., Gaur, Y., Zhao, R., Lu, L., Sun, E., Li, J., & Gong, Y. (2020). Combination of end-to-end and hybrid models for speech recognition. In Interspeech (pp. 1783–1787).

  • Wu, H., Wang, Y., & Huang, J. (2017). Identification of reconstructed speech. ACM Transactions on Multimedia Computing Communications and Applications, 13(1), 1–20.

    Article  Google Scholar 

  • Xu, Z. (2016). A preliminary study on the teaching strategies of English vocabulary in senior high school. New curriculum, middle school (12)

  • Xuhong, Z. (2015). Study on the identification of risk based on voice emotion. Guangdong University of Technology

  • Yang, C.-H. H., Qi, J., Chen, S. Y.-C., Chen, P.-Y., Siniscalchi, S. M., Ma, X., & Lee, C.-H. (2021). Decentralizing feature extraction with quantum convolutional neural network for automatic speech recognition. In ICASSP 2021–2021 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 6523–6527). IEEE

  • Yanzhan, Wu. (2016). Improved speech recognition system based on HMM and genetic neural network. Computer System Application, 01, 204–208.

    Google Scholar 

  • Yi, Du. (2020). Exploration of English education based on network environment. Chinese and Foreign Entrepreneurs, 678(16), 170–170.

    Google Scholar 

  • Zhenhua, L. (2017). The “plate” nature of English vocabulary and Its Enlightenment on English Teaching. Campus English, 000(001), 41–41.

    Google Scholar 

  • Zhenhua, P. (2019). Research on the application of conceptual metaphor in high school English Vocabulary Teaching. Campus English (26)

Download references

Funding

(1) The school-level teaching reform project of Chongqing University of Education "Research and Practice on the Training Model of Speaking and Writing Ability for Applied Undergraduate Business English Majors Based on the Hypothesis of "Output Drive-Input Promotion"" JD2017042. (2) Chongqing Educational Science "13th Five-Year Plan" General Project in 2018 "Action Research on the Core Competence and Professional Development Path of Undergraduate Business English Teachers under the Background of "One Belt One Road"-Taking Chongqing Universities as an Example", 2018-GX-333.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lu Yang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, L. HTK-based speech recognition and corpus-based English vocabulary online guiding system. Int J Speech Technol 25, 921–931 (2022). https://doi.org/10.1007/s10772-022-09968-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-022-09968-7

Keywords

Navigation