Development and Evaluation of Julius-Compatible Interface for Kaldi ASR

Yusuke Yamada⁷,
Takashi Nose⁷,
Yuya Chiba⁷,
Akinori Ito⁷ &
…
Takahiro Shinozaki⁸

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 82))

Included in the following conference series:

International Conference on Intelligent Information Hiding and Multimedia Signal Processing

1192 Accesses

Abstract

In recent years, the use of Kaldi has rapidly grown because it has adopted various technologies of DNN-based speech recognition in succession and has shown high recognition performance. On the other hand, the speech recognition engine, Julius, has been widely used especially in Japan. Julius is also attracting attention since DNN-HMM is implemented in it. In this paper, we describe the design plan of interfaces that make Kaldi speech recognition engine be compatible with Julius, a system overview, and the details of the speech input unit and the recognition result output unit. We also refer to the functions that we are planning to implement.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Review of Toolkit to Build Automatic Speech Recognition Models

Comparative Evaluation of Speech Recognition Systems Based on Different Toolkits

DNN based continuous speech recognition system of Punjabi language on Kaldi toolkit

Article 20 May 2020

References

The Hidden Markov Model Toolkit (HTK), http://htk.eng.cam.ac.uk/
Glas, D.F., Minato, T., Ishi, C.T., Kawahara, T., Ishiguro, H.: Erica: the erato intelligent conversational android. In: Proceedings of the 25th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), pp. 22–29 (2016)
Google Scholar
Ijima, Y., Nose, T., Tachibana, M., Kobayashi, T.: A rapid model adaptation technique for emotional speech recognition with style estimation based on multiple-regression HMM. IEICE Trans. Inf. Syst. 93(1), 107–115 (2010)
Article Google Scholar
Kawahara, T., Nanjo, H., Shinozaki, T., Furui, S.: Benchmark test for speech recognition using the corpus of spontaneous japanese. In: ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition, pp. 1–4 (2003)
Google Scholar
Lee, A., Kawahara, T.: Recent development of open-source speech recognition engine julius. In: Proceedings of APSIPA ASC, pp. 131–137 (2009)
Google Scholar
Povey, D., Ghoshal, A., Boulianne, G., Burget, L., Glembek, O., Goel, N., Hannemann, M., Motlicek, P., Qian, Y., Schwarz, P., et al.: The kaldi speech recognition toolkit. In: Proceedings of IEEE Workshop on Automatic Speech Recognition And Understanding (ASRU) (2011)
Google Scholar
Zhang, X., Trmal, J., Povey, D., Khudanpur, S.: Improving deep neural network acoustic models using generalized maxout networks. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 215–219 (2014)
Google Scholar

Download references

Acknowledgment

Part of this work was supported by JSPS KAKENHI Grant Number JP26280055 and JP15H02720.

Author information

Authors and Affiliations

Graduate School of Engineering, Tohoku University, Aramaki Aza-Aoba 6–6–05, Aoba-ku, Sendai-shi, Miyagi, 980–8579, Japan
Yusuke Yamada, Takashi Nose, Yuya Chiba & Akinori Ito
Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama, 226-8502, Japan
Takahiro Shinozaki

Authors

Yusuke Yamada
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Nose
View author publications
You can also search for this author in PubMed Google Scholar
Yuya Chiba
View author publications
You can also search for this author in PubMed Google Scholar
Akinori Ito
View author publications
You can also search for this author in PubMed Google Scholar
Takahiro Shinozaki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yusuke Yamada .

Editor information

Editors and Affiliations

Fujian Provincial Key Lab of Big Data Mining and Applications, Fujian University of Technology, Fuzhou, Fujian, China
Jeng-Shyang Pan
Swinburne University of Technology, Hawthorn, Victoria, Australia
Pei-Wei Tsai
Universiti Teknologi Petronas, Teronoh, Malaysia
Junzo Watada
University of Canberra, Bruce, Aust Capital Terr, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yamada, Y., Nose, T., Chiba, Y., Ito, A., Shinozaki, T. (2018). Development and Evaluation of Julius-Compatible Interface for Kaldi ASR. In: Pan, JS., Tsai, PW., Watada, J., Jain, L. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. IIH-MSP 2017. Smart Innovation, Systems and Technologies, vol 82. Springer, Cham. https://doi.org/10.1007/978-3-319-63859-1_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-63859-1_12
Published: 18 July 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63858-4
Online ISBN: 978-3-319-63859-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Development and Evaluation of Julius-Compatible Interface for Kaldi ASR

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Review of Toolkit to Build Automatic Speech Recognition Models

Comparative Evaluation of Speech Recognition Systems Based on Different Toolkits

DNN based continuous speech recognition system of Punjabi language on Kaldi toolkit

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Development and Evaluation of Julius-Compatible Interface for Kaldi ASR

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Review of Toolkit to Build Automatic Speech Recognition Models

Comparative Evaluation of Speech Recognition Systems Based on Different Toolkits

DNN based continuous speech recognition system of Punjabi language on Kaldi toolkit

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation