Abstract
Conversational interfaces have recently become ubiquitous in the personal sphere by improving an individual’s quality of life and industrial environments by automating services and their corresponding cost savings. However, designing the dialog model used by these interfaces to decide the following response is a hard-to-accomplish task for complex conversational interactions. This paper proposes a statistical-based dialog manager architecture, which provides flexibility to develop and maintain this module. Our proposal has been integrated using DialogFlow, a natural language understanding platform provided by Google to design conversational user interfaces. The proposed hybrid architecture has been assessed with a real use case for a train scheduling domain, proving that the user experience is highly valued and can be integrated into commercial setups.
The research leading to these results has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement no. 823907 (MENHIR project: https://menhir-project.eu) and the projects supported by the Spanish Ministry of Science and Innovation through the GOMINOLA (PID2020-118112RB-C21 and PID2020-118112RB-C22, funded by MCIN/AEI/10.13039/501100011033), and CAVIAR (TEC2017-84593-C2-1-R, funded by MCIN/AEI/10.13039/501100011033/FEDER).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Canas, P., Griol, D., Callejas, Z.: Towards versatile conversations with data-driven dialog management and its integration in commercial platforms. J. Comput. Sci. 55, 101443 (2021)
Dutoit, T.: An Introduction to Text-to-Speech Synthesis. Kluwer Academic Publishers, New York (1996)
Griol, D., Hurtado, L., Segarra, E., Sanchis, E.: A domain-independent statistical methodology for dialog management in spoken dialog systems. Comput. Speech Lang. 28(3), 743–768 (2014)
Kumar, N., Raubal, M.: Applications of deep learning in congestion detection, prediction and alleviation: a survey. Transp. Res. 133, 103432 (2021)
Lemon, O.: Learning what to say and how to say it: joint optimisation of spoken dialogue management and natural language generation. Comput. Speech Lang. 25, 210–221 (2011)
Levin, E., Pieraccini, R., Eckert, W.: Using Markov decision process for learning dialogue strategies. In: Proceedings of ICASSP 1998. vol. 1, pp. 201–204. Seattle, WA, USA (1998)
Levin, E., Pieraccini, R., Eckert, W.: A stochastic model of human-machine interaction for learning dialog strategies. IEEE Trans. Speech Audio Process. 8(1), 11–23 (2000)
Li, W., et al.: A perspective survey on deep transfer learning for fault diagnosis in industrial scenarios: theories, applications and challenges. Mech. Syst. Signal Process. 167, 108487 (2022)
Matějů, L., Griol, D., Callejas, Z., Molina, J.M., Sanchis, A.: An empirical assessment of deep learning approaches to task-oriented dialog management. Neurocomputing 439, 327–339 (2021)
McTear, M.: Conversational AI. Dialogue Systems, Conversational Agents, and Chatbots. Morgan and Claypool Publishers (2020)
McTear, M., Callejas, Z., Griol, D.: The Conversational Interface. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-32967-3
O’Shaughnessy, D.: Invited paper: automatic speech recognition: history, methods and challenges. Pattern Recogn. 41, 2965–2979 (2008)
Rejaibi, E., Komaty, A., Meriaudeau, F., Agrebi, S., Othmani, A.: MFCC-based recurrent neural network for automatic clinical depression recognition and assessment from speech. Biomed. Signal Process. Control 71, 103107 (2022)
Traum, D.R., Larsson, S.: The information state approach to dialogue management. In: van Kuppevelt, J., Smith, R.W. (eds.) Current and New Directions in Discourse and Dialogue. Text, Speech and Language Technology, vol. 22, pp. 325–353. Springer, Netherlands (2003). https://doi.org/10.1007/978-94-010-0019-2_15
Tsilfidis, A., Mporas, I., Mourjopoulos, J., Fakotakis, N.: Automatic speech recognition performance in different room acoustic environments with and without dereverberation preprocessing. Comput. Speech Lang. 27, 380–395 (2013)
Wu, W.L., Lu, R.Z., Duan, J.Y., Liu, H., Gao, F., Chen, Y.Q.: Spoken language understanding using weakly supervised learning. Comput. Speech Lang. 24, 358–382 (2010)
Xue, J., Zheng, T., Han, J.: Exploring attention mechanisms based on summary information for end-to-end automatic speech recognition. Neurocomputing 465, 514–524 (2021)
Young, S.: Hey Cyba. The Inner Workings of a Virtual Personal Assistant, Cambridge University Press, Cambridge (2021)
Young, S., et al.: The hidden information state model: a practical framework for POMDP-based spoken dialogue management. Comput. Speech Lang. 24, 150–174 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Cañas, P., Griol, D., Callejas, Z. (2022). A Proposal for Developing and Deploying Statistical Dialog Management in Commercial Conversational Platforms. In: García Bringas, P., et al. Hybrid Artificial Intelligent Systems. HAIS 2022. Lecture Notes in Computer Science(), vol 13469. Springer, Cham. https://doi.org/10.1007/978-3-031-15471-3_35
Download citation
DOI: https://doi.org/10.1007/978-3-031-15471-3_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-15470-6
Online ISBN: 978-3-031-15471-3
eBook Packages: Computer ScienceComputer Science (R0)