Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3631700.3664902acmconferencesArticle/Chapter ViewAbstractPublication PagesumapConference Proceedingsconference-collections
research-article

Using Large Language Models for Adaptive Dialogue Management in Digital Telephone Assistants

Published: 28 June 2024 Publication History

Abstract

The advent of modern information technology such as Large Language Models (LLMs) allows for massively simplifying and streamlining the communication processes in human-machine interfaces. In the specific domain of healthcare, and for patient practice interaction in particular, user acceptance of automated voice assistants remains a challenge to be solved. We explore approaches to increase user satisfaction by language model based adaptation of user-directed utterances. The presented study considers parameters such as gender, age group, and sentiment for adaptation purposes. Different LLMs and open-source models are evaluated for their effectiveness in this task. The models are compared, and their performance is assessed based on speed, cost, and the quality of the generated text, with the goal of selecting an ideal model for utterance adaptation. We find that carefully designed prompts and a well-chosen set of evaluation metrics, which balance the relevancy and adequacy of adapted utterances, are crucial for optimizing user satisfaction in conversational artificial intelligence systems successfully. Importantly, our research demonstrates that the GPT-3.5-turbo model currently provides the most balanced performance in terms of adaptation relevancy and adequacy, underscoring its suitability for scenarios that demand high adherence to the information in the original utterances, as required in our case.

References

[1]
Urmil Bharti, Deepali Bajaj, Hunar Batra, Shreya Lalit, Shweta Lalit, and Aayushi Gangwani. 2020. Medbot: Conversational artificial intelligence powered chatbot for delivering tele-health after covid-19. In 2020 5th international conference on communication and electronics systems (ICCES). IEEE, 870–875.
[2]
Sarah E Finch and Jinho D Choi. 2020. Towards unified dialogue system evaluation: A comprehensive analysis of current evaluation protocols. arXiv preprint arXiv:2006.06110 (2020).
[3]
Jia-Chen Gu, Hui Liu, Zhen-Hua Ling, Quan Liu, Zhigang Chen, and Xiaodan Zhu. 2021. Partner matters! an empirical study on fusing personas for personalized response selection in retrieval-based chatbots. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 565–574.
[4]
Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, and Jacob Steinhardt. 2021. Measuring Massive Multitask Language Understanding. arxiv:2009.03300 [cs.CY]
[5]
Zhiyuan Hu, Yue Feng, Anh Tuan Luu, Bryan Hooi, and Aldo Lipani. 2023. Unlocking the potential of user feedback: Leveraging large language model as user simulators to enhance dialogue system. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management. 3953–3957.
[6]
Jessica Huynh, Cathy Jiao, Prakhar Gupta, Shikib Mehri, Payal Bajaj, Vishrav Chaudhary, and Maxine Eskenazi. 2023. Understanding the effectiveness of very large language models on dialog evaluation. arXiv preprint arXiv:2301.12004 (2023).
[7]
Margaret Li, Jason Weston, and Stephen Roller. 2019. Acute-eval: Improved dialogue evaluation with optimized questions and multi-turn comparisons. arXiv preprint arXiv:1909.03087 (2019).
[8]
Christopher Manning and Hinrich Schutze. 1999. Foundations of statistical natural language processing. MIT press.
[9]
Nourchène Ouerhani, Ahmed Maalel, and Henda Ben Ghézela. 2020. SPeCECA: a smart pervasive chatbot for emergency case assistance based on cloud computing. Cluster Computing 23 (2020), 2471–2482.
[10]
Xinmeng Song and Ting Xiong. 2021. A survey of published literature on conversational artificial intelligence. In 2021 7th International conference on information management (ICIM). IEEE, 113–117.
[11]
David Suendermann, Jackson Liscombe, and Roberto Pieraccini. 2010. Optimize the obvious: Automatic call flow generation. In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 5370–5373.
[12]
Bernhard Suhm and Pat Peterson. 2002. A data-driven methodology for evaluating and optimizing call center IVRs. International Journal of Speech Technology 5 (2002), 23–37.
[13]
Marko Tkalčič, Berardina De Carolis, Marco De Gemmis, Ante Odić, and Andrej Košir. 2016. Emotions and personality in personalized services. Human-Computer Interaction Series. Elsevier (2016).
[14]
Stefano Valtolina, Barbara Rita Barricelli, and Serena Di Gaetano. 2020. Communicability of traditional interfaces VS chatbots in healthcare and smart home domains. Behaviour & Information Technology 39, 1 (2020), 108–132.
[15]
Vanessa Voelskow, Claudia Meßner, Tobias Kurth, Amelie Busam, Toivo Glatz, and Natalie Ebert. 2023. Prospective mixed-methods study evaluating the potential of a voicebot (CovBot) to relieve German health authorities during the COVID-19 infodemic. Digital Health 9 (2023), 20552076231180677.
[16]
Chen Zhang, Luis Fernando D’Haro, Yiming Chen, Malu Zhang, and Haizhou Li. 2024. A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators. Proceedings of the AAAI Conference on Artificial Intelligence 38, 17 (Mar. 2024), 19515–19524. https://doi.org/10.1609/aaai.v38i17.29923

Cited By

View all
  • (2024)HAAPIE 2024: 9th International Workshop on Human Aspects in Adaptive and Personalized Interactive EnvironmentsAdjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization10.1145/3631700.3658526(362-364)Online publication date: 27-Jun-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
UMAP Adjunct '24: Adjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization
June 2024
662 pages
ISBN:9798400704666
DOI:10.1145/3631700
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 June 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Adaptive Dialogue Systems
  2. Digital Telephone Assistants
  3. Large Language Models

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • German Federal Ministry of Education and Research (BMBF)

Conference

UMAP '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 162 of 633 submissions, 26%

Upcoming Conference

UMAP '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)104
  • Downloads (Last 6 weeks)17
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)HAAPIE 2024: 9th International Workshop on Human Aspects in Adaptive and Personalized Interactive EnvironmentsAdjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization10.1145/3631700.3658526(362-364)Online publication date: 27-Jun-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media