Abstract
This article describes knowledge-based spoken dialogue system design from scratch. It covers all stages which were performed during the period of three weeks: definition of semantic goals and entities, data collection and recording of sample dialogues, data annotation, parser and grammars design, dialogue manager design and testing. The work was focused mainly on rapid development of such a dialogue system. The final implementation was written in dynamically generated VoiceXML. The large vocabulary continuous speech recognition system was used and the language understanding module was implemented using non-recursive probabilistic context free grammars which were converted to finite states transducers. The design and implementation has been verified on a railway information service task with a real large-scale database. The paper describes an innovative combination of data, expert knowledge and state-of-the-art methods which allow fast spoken dialogue system design.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
McTear, M.: Using the CSLU Toolkit for Practicals in Spoken Dialogue Technology. In: Proceedings of ESCA/SOCRATES Workshop on Method and Tool Innovations for Speech Science Education (1999)
Bringert, B.: Rapid Development of Dialogue Systems by Grammar Compilation. In: Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, Antwerp, Belgium, pp. 223–226 (2007)
Akolkar, R.P., Faruquie, T.A., Huerta, J., Kankar, P., Rajput, N., Raman, T.V., Udupa, R.U., Verma, A.: Reusable Dialog Component Framework for Rapid Voice Application Development. In: Heineman, G.T., Crnković, I., Schmidt, H.W., Stafford, J.A., Ren, X.-M., Wallnau, K. (eds.) CBSE 2005. LNCS, vol. 3489, pp. 306–321. Springer, Heidelberg (2005)
Sonntag, D., Sonnenberg, G., Nesselrath, R., Herzog, G.: Supporting a Rapid Dialogue System Engineering Process. In: Proceedings of IWSDS, Kloster Irsee, Germany (2009)
Šmídl, L., Valenta, T.: WebTransc – Software, Department of Cybernetics, Faculty of Applied Sciences, University of West Bohemia, Pilsen (2010), http://www.kky.zcu.cz/en/sw/wt
Pražák, A., Müller, L., Psutka Josef, V., Psutka, J.: Live TV Subtitling – Fast 2-pass LVCSR System for Online Subtitling. In: Proceedings of SIGMAP 2007, pp. 139–142. INSTICC PRESS, Lisabon (2007)
Švec, J., Šmídl, L.: Real-time Large Vocabulary Spontaneous Speech Recognition for Spoken Dialogue Systems. In: Proceedings of the 4th Int. Cong. on Image and Signal Processing, Shanghai, pp. 2458–2463 (2011)
Tihelka, D., Kala, J., Matoušek, J.: Enhancements of Viterbi Search for Fast Unit Selection Synthesis. In: Proceedings of Int. Conf. Interspeech 2010, pp. 174–177 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Valenta, T., Švec, J., Šmídl, L. (2012). Spoken Dialogue System Design in 3 Weeks. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2012. Lecture Notes in Computer Science(), vol 7499. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32790-2_76
Download citation
DOI: https://doi.org/10.1007/978-3-642-32790-2_76
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32789-6
Online ISBN: 978-3-642-32790-2
eBook Packages: Computer ScienceComputer Science (R0)