Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3173574.3174214acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article

Voice Interfaces in Everyday Life

Published: 21 April 2018 Publication History

Abstract

Voice User Interfaces (VUIs) are becoming ubiquitously available, being embedded both into everyday mobility via smartphones, and into the life of the home via 'assistant' devices. Yet, exactly how users of such devices practically thread that use into their everyday social interactions remains underexplored. By collecting and studying audio data from month-long deployments of the Amazon Echo in participants' homes-informed by ethnomethodology and conversation analysis-our study documents the methodical practices of VUI users, and how that use is accomplished in the complex social life of the home. Data we present shows how the device is made accountable to and embedded into conversational settings like family dinners where various simultaneous activities are being achieved. We discuss how the VUI is finely coordinated with the sequential organisation of talk. Finally, we locate implications for the accountability of VUI interaction, request and response design, and raise conceptual challenges to the notion of designing 'conversational' interfaces.

Supplementary Material

MP4 File (pn4837.mp4)

References

[1]
J. Maxwell Atkinson and John Heritage. 1984. Transcript Notation. In Structures of Social Action: Studies in Conversation Analysis. Cambridge University Press, ix--xvi.
[2]
Liam Bannon, John Bowers, Peter Carstensen, John A. Hughes, Kari Kuutii, James Pycock, Tom Rodden, Kjeld Schmidt, Dan Shapiro, Wes Sharrock, and Stephen Viller. 1993. Informing CSCW System Requirements. In COMIC Deliverable 2.1.
[3]
Graham Button, Jeff Coulter, John R. E. Lee, and Wes Sharrock. 1995. Computers, Minds and Conduct. Polity Press, Cambridge, UK.
[4]
Andy Crabtree, Steve Benford, Chris Greenhalgh, Paul Tennent, Matthew Chalmers, and Barry Brown. 2006. Supporting Ethnographic Studies of Ubiquitous Computing in the Wild. In Proceedings of the 6th ACM Conference on Designing Interactive Systems (DIS '06), 60.
[5]
David DeVault, Ron Artstein, Grace Benn, Teresa Dey, Ed Fast, Alesia Gainer, Kallirroi Georgila, Jon Gratch, Arno Hartholt, Margaux Lhommet, Gale Lucas, Stacy Marsella, Fabrizio Morbini, Angela Nazarian, Stefan Scherer, Giota Stratou, Apar Suri, David Traum, Rachel Wood, Yuyu Xu, Albert Rizzo, and Louisphilippe Morency. 2014. SimSensei Kiosk: A Virtual Human Interviewer for Healthcare Decision Support. International Conference on Autonomous Agents and Multi-Agent Systems, 1: 1061--1068.
[6]
Paul Dourish and Graham Button. 1998. On "Technomethodology": Foundational Relationships Between Ethnomethodology and System Design. Human-Computer Interaction 13, 4: 395--432.
[7]
Hasan Shahid Ferdous, Frank Vetere, Hilary Davis, Bernd Ploderer, and Kenton OHara. 2016. Technologies At Mealtime: Collocated Interactions In The Family Home. In CHI '16 Workshop on Proxemic Mobile Collocated Interactions.
[8]
Harold Garfinkel. 1967. Studies in Ethnomethodology. Prentice-Hall.
[9]
Nigel Gilbert, Robin Wooffitt, and Norman Fraser. 1990. Organising Computer Talk. In Computers and Conversation (1st edition), Paul Luff, David Frohlich and Nigel Gilbert (eds.). Academic Press, 235 -- 257.
[10]
Charles Goodwin and John Heritage. 1990. Conversation Analysis. Annual Review of Anthropology 19, 1: 283--307.
[11]
Christian Heath, Jon Hindmarsh, and Paul Luff. 2010. Video in Qualitative Research. SAGE.
[12]
Jiepu Jiang, Wei Jeng, and Daqing He. 2013. How Do Users Respond to Voice Input Errors?: Lexical and Phonetic Query Reformulation in Voice Search. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '13), 143--152.
[13]
Mohammed Waleed Kadous and Claude Sammut. 2004. InCa: A Mobile Conversational Agent. PRICAI 2004: Trends in Artificial Intelligence 3157: 644--653.
[14]
Stefan Kopp, Lars Gesellensetter, Nicole C. Krämer, and Ipke Wachsmuth. 2005. A Conversational Agent as Museum Guide -- Design and Evaluation of a RealWorld Application. In Lecture Notes in Computer Science, 329--343.
[15]
Stephen C. Levinson. 1983. Pragmatics. Cambridge University Press.
[16]
J. C. R. Licklider. 1960. Man-Computer Symbiosis. IRE Transactions on Human Factors in Electronics HFE-1, 1: 4--11.
[17]
Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf between User Expectation and Experience of Conversational Agents. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16), 5286--5297.
[18]
Moira McGregor and John Tang. 2017. More to Meetings: Challenges in Using Speech-Based Technology to Support Meetings. In Proceedings of the 20th ACM Conference on Computer-Supported Cooperative Work & Social Computing (CSCW '17).
[19]
Michael McTear. 2002. Spoken Dialogue Technology: Enabling the Conversational User Interface. ACM Computing Surveys 34, 1: 90--169.
[20]
Michael McTear, Zoraida Callejas, and David Griol. 2016. The Conversational Interface. Springer International Publishing.
[21]
Dong Nguyen, A. Seza Doğruöz, Carolyn P. Rosé, and Franciska de Jong. 2016. Computational Sociolinguistics: A Survey. Computational Linguistics 42, 3: 537--593.
[22]
Kenton O'Hara, Richard Harper, Helena Mentis, Abigail Sellen, and Alex Taylor. 2013. On the Naturalness of Touchless: Putting the "Interaction" Back into NUI. ACM Transactions on ComputerHuman Interaction 20, 1: 1--25.
[23]
Sabine Payr. 2013. Virtual butlers and real people: styles and practices in long-term use of a companion. In Your Virtual Butler, Robert Trappl (ed.). SpringerVerlag Berlin, Heidelberg, 134--178.
[24]
Hannah R. M. Pelikan and Mathias Broth. 2016. Why That Nao?: How Humans Adapt to a Conventional Humanoid Robot in Taking Turns-at-Talk. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16), 4921--4932.
[25]
Stefania Pizza, Barry Brown, Donald McMillan, and Airi Lampinen. 2016. Smartwatch in vivo. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16), 5456--5469.
[26]
Martin Porcheron, Joel E. Fischer, Moira McGregor, Barry Brown, Ewa Luger, Heloisa Candello, and Kenton O'Hara. 2017. Talking with Conversational Agents in Collaborative Action. In Companion of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '17 Companion), 431--436.
[27]
Martin Porcheron, Joel E. Fischer, and Sarah Sharples. 2017. "Do Animals Have Accents?": Talking with Agents in Multi-Party Conversation. In Proceedings of the 20th ACM Conference on Computer-Supported Cooperative Work & Social Computing (CSCW '17).
[28]
Stuart Reeves and Barry Brown. 2016. Embeddedness and Sequentiality in Social Media. In Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing (CSCW '16), 1050--1062.
[29]
Jacob M. Rigby, Duncan P. Brumby, Sandy J. J. Gould, and Anna L Cox. 2017. Media Multitasking at Home. In Proceedings of the 2017 ACM International Conference on Interactive Experiences for TV and Online Video (TVX '17), 3--10.
[30]
Sean Rintel, Richard Harper, and Kenton O'Hara. 2016. The Tyranny of the Everyday in Mobile Video Messaging. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16). ACM, New York, NY, USA, 47814792.
[31]
John Rooksby, Timothy E. Smith, Alistair Morrison, Mattias Rost, and Matthew Chalmers. 2015. Configuring Attention in the Multiscreen Living Room. In Proceedings of the 14th European Conference on Computer Supported Cooperative Work (ECSCW '15), 243--261.
[32]
Harvey Sacks. 1992. Harvey Sacks: Lectures on Conversation. Basil Publishing, Oxford.
[33]
Harvey Sacks, Emanuel A. Schegloff, and Gail Jefferson. 1974. A Simplest Systematics for the Organization of Turn-Taking for Conversation. Language 50, 4: 696--735.
[34]
Emanuel A. Schegloff. 1987. Analyzing Single Episodes of Interaction: An Exercise in Conversation Analysis. Social Psychology Quarterly 50, 2: 101--114.
[35]
Emanuel A. Schegloff. 2007. Sequence Organization in Interaction. Cambridge University Press, Cambridge.
[36]
Tanya Stivers, N. J. Enfield, Penelope Brown, Christina Englert, Makoto Hayashi, Trine Heinemann, Gertie Hoymann, Federico Rossano, Jan Peter de Ruiter, Kyung-Eun Yoon, and Stephen C Levinson. 2009. Universals and cultural variation in turn-taking in conversation. Proceedings of the National Academy of Sciences of the United States of America 106, 26: 10587--92.
[37]
Peter Tolmie and Andy Crabtree. 2008. Deploying Research Technology in the Home. In Proceedings of the 2008 ACM Conference on Computer Supported Cooperative Work (CSCW '08), 639--648.
[38]
Peter Tolmie, Andy Crabtree, Tom Rodden, and Steve Benford. 2008. "Are You Watching This Film or What?" Interruption and the Juggling of Cohorts. In Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work (CSCW '08), 257.
[39]
Sherry Turkle. 2011. Alone Together: Why We Expect More from Technology and Less from Each Other. Basic Books.
[40]
Laura Pfeifer Vardoulakis, Lazlo Ring, Barbara Barry, Candace L. Sidner, and Timothy Bickmore. 2012. Designing Relational Agents as Long Term Social Companions for Older Adults. In Intelligent Virtual Agents. 289--302.
[41]
Robin Wooffitt. 1994. Applying Sociology: Conversation Analysis in the Study of Human(Simulated) Computer Interaction. Bulletin de Méthodologie Sociologique 43, 1: 7--33.
[42]
Victor Zue, Stephanie Seneff, J. R. Glass, Joseph Polifroni, Christine Pao, T. J. Hazen, and Lee Hetherington. 2000. JUPlTER: a telephone-based conversational interface for weather information. IEEE Transactions on Speech and Audio Processing 8, 1: 85--96.

Cited By

View all
  • (2024)Beyond Binary Dialogues: Research and Development of a Linguistically Nuanced Conversation Design for Social Robots in Group–Robot InteractionsApplied Sciences10.3390/app14221031614:22(10316)Online publication date: 9-Nov-2024
  • (2024)Gender and Accent Biases in AI-Based Tools for Spanish: A Comparative Study between Alexa and WhisperApplied Sciences10.3390/app1411473414:11(4734)Online publication date: 30-May-2024
  • (2024)You have interrupted me again!: making voice assistants more dementia-friendly with incremental clarificationFrontiers in Dementia10.3389/frdem.2024.13430523Online publication date: 12-Mar-2024
  • Show More Cited By

Index Terms

  1. Voice Interfaces in Everyday Life

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems
    April 2018
    8489 pages
    ISBN:9781450356206
    DOI:10.1145/3173574
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 April 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Badges

    • Best Paper

    Author Tags

    1. amazon echo
    2. collocated interaction
    3. conversation analysis
    4. conversational agent
    5. conversational user interface
    6. ethnomethodology
    7. intelligent personal assistants

    Qualifiers

    • Research-article

    Funding Sources

    • EPSRC

    Conference

    CHI '18
    Sponsor:

    Acceptance Rates

    CHI '18 Paper Acceptance Rate 666 of 2,590 submissions, 26%;
    Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

    Upcoming Conference

    CHI '25
    CHI Conference on Human Factors in Computing Systems
    April 26 - May 1, 2025
    Yokohama , Japan

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)651
    • Downloads (Last 6 weeks)83
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Beyond Binary Dialogues: Research and Development of a Linguistically Nuanced Conversation Design for Social Robots in Group–Robot InteractionsApplied Sciences10.3390/app14221031614:22(10316)Online publication date: 9-Nov-2024
    • (2024)Gender and Accent Biases in AI-Based Tools for Spanish: A Comparative Study between Alexa and WhisperApplied Sciences10.3390/app1411473414:11(4734)Online publication date: 30-May-2024
    • (2024)You have interrupted me again!: making voice assistants more dementia-friendly with incremental clarificationFrontiers in Dementia10.3389/frdem.2024.13430523Online publication date: 12-Mar-2024
    • (2024)How a Child Learns to ‘Talk’ to a Smart Speaker: On the Emergence of Enlanguaged PracticesLinguistic Frontiers10.2478/lf-2024-00107:1(1-22)Online publication date: 5-Jul-2024
    • (2024)“You are Apple, why are you speaking to me in Turkish?”: the role of English in voice assistant interactionsMultilingua10.1515/multi-2023-007243:4(455-485)Online publication date: 27-Feb-2024
    • (2024)The Effects of Human-Robot Interactions and the Human-Robot Relationship on Robot Competence, Trust, and AcceptanceSage Open10.1177/2158244024124823014:2Online publication date: 17-May-2024
    • (2024)Interactive probes: Towards action-level evaluation for dialogue systemsDiscourse & Communication10.1177/17504813241267071Online publication date: 12-Sep-2024
    • (2024)User practices in dealing with trouble in interactions with virtual assistants in German: Repeating, altering and insistingDiscourse & Communication10.1177/175048132411271494Online publication date: 28-Aug-2024
    • (2024)The disciplined customer: A video-based study of automated self-service hotelsNew Media & Society10.1177/1461444824125179326:9(5013-5038)Online publication date: 30-Aug-2024
    • (2024)The Importance of Timing—An Expert Evaluation on Latencies for Voice AssistantsProceedings of the Human Factors and Ergonomics Society Annual Meeting10.1177/10711813241260290Online publication date: 10-Aug-2024
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media