research-article

ChitChatGuide: Conversational Interaction Using Large Language Models for Assisting People with Visual Impairments to Explore a Shopping Mall

Authors:

Shigeo MorishimaAuthors Info & Claims

Proceedings of the ACM on Human-Computer Interaction, Volume 8, Issue MHCI

Article No.: 247, Pages 1 - 25

https://doi.org/10.1145/3676492

Published: 24 September 2024 Publication History

Get Access

Abstract

To enable people with visual impairments (PVI) to explore shopping malls, it is important to provide information for selecting destinations and obtaining information based on the individual's interests. We achieved this through conversational interaction by integrating a large language model (LLM) with a navigation system. ChitChatGuide allows users to plan a tour through contextual conversations, receive personalized descriptions of surroundings based on transit time, and make inquiries during navigation. We conducted a study in a shopping mall with 11 PVI, and the results reveal that the system allowed them to explore the facility with increased enjoyment. The LLM-based conversational interaction, by understanding vague and context-based questions, enabled the participants to explore unfamiliar environments effectively. The personalized and in-situ information generated by the LLM was both useful and enjoyable. Considering the limitations we identified, we discuss the criteria for integrating LLMs into navigation systems to enhance the exploration experiences of PVI.

Supplemental Material

ZIP File

Supplemental video

Download
187.58 MB

ZIP File

This folder has two files: "README.txt" and "ChitChatGuide_CameraReady_Appendix.pdf." The "README.txt" file explains how the folder is organized. The "ChitChatGuide_CameraReady_Appendix.pdf" file is the appendix file. The "ChitChatGuide_Cameraready_Appendix.pdf" file has examples of prompts and responses from GPT-4 model, a conversation example in tour planning, and examples of generated POI descriptions.

Download
297.69 KB

References

[1]

Ali Abdolrahmani, Maya Howes Gupta, Mei-Lian Vader, Ravi Kuber, and Stacy Branham. 2021. Towards More Transactional Voice Assistants: Investigating the Potential for a Multimodal Voice-Activated Indoor Navigation Assistant for Blind and Sighted Travelers. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, New York, NY, USA, Article 495, 16 pages. https://doi.org/10.1145/3411764.3445638

Abstract

Supplemental Material

References

Index Terms

Recommendations

Navigating Real-World Challenges: A Quadruped Robot Guiding System for Visually Impaired People in Diverse Environments

PathFinder: Designing a Map-less Navigation System for Blind People in Unfamiliar Buildings

Airport Accessibility and Navigation Assistance for People with Visual Impairments

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations