Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3313831.3376310acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article

An Honest Conversation: Transparently Combining Machine and Human Speech Assistance in Public Spaces

Published: 23 April 2020 Publication History

Abstract

There is widespread concern over the ways speech assistant providers currently use humans to listen to users' queries without their knowledge. We report two iterations of the TalkBack smart speaker, which transparently combines machine and human assistance. In the first, we created a prototype to investigate whether people would choose to forward their questions to a human answerer if the machine was unable to help. Longitudinal deployment revealed that most users would do so when given the explicit choice. In the second iteration we extended the prototype to draw upon spoken answers from previous deployments, combining machine efficiency with human richness. Deployment of this second iteration shows that this corpus can help provide relevant, human-created instant responses. We distil lessons learned for those developing conversational agents or other AI-infused systems about how to appropriately enlist human-in-the-loop information services to benefit users, task workers and system performance.

Supplementary Material

MP4 File (a183-reitmaier-presentation.mp4)

References

[1]
Saleema Amershi, Dan Weld, Mihaela Vorvoreanu, Adam Fourney, Besmira Nushi, Penny Collisson, Jina Suh, Shamsi Iqbal, Paul N. Bennett, Kori Inkpen, Jaime Teevan, Ruth Kikin-Gil, and Eric Horvitz. 2019. Guidelines for Human-AI Interaction. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI '19). ACM, NY, NY, USA, Article 3, 13 pages.
[2]
Apache. 2019. Apache Solr Reference Guide 8.1 | Language Analysis. Retrieved 2019-09--20 from https://lucene.apache.org/solr/guide/8_1/language-an alysis.html#hindi
[3]
BBC News. 2019a. Mobile data: Why India has the world's cheapest. Retrieved 2019--12--26 from https://www.bbc.com/news/world-asia-india-47537201
[4]
BBC News. 2019b. Smart speaker recordings reviewed by humans. Retrieved 2019-09--20 from https://www.bbc.com/news/technology-47893082
[5]
Apoorva Bhalla. 2018. An Exploratory Study Understanding the Appropriated Use of Voice-based Search and Assistants. In Proceedings of the 9th Indian Conference on Human Computer Interaction (IndiaHCI' 18). ACM, NY, NY, USA, 90--94.
[6]
Eli Blevis and Eric Stolterman. 2006. Regarding software as a material of design. In Wonderground Design Research Society Conference. Design Research Society, London, UK, Article 68, 18 pages. https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10. 1.1.364.9981
[7]
Joseph Campana (Ed.). 2013. Dharavi: the city within. Harper Collins India, New Delhi, India.
[8]
Matt Day, Giles Turner, and Natalia Drozdiak. 2019. Amazon Workers Are Listening to What You Tell Alexa. Retrieved 2019-09-01 from https://www.bloomb erg.com/news/articles/2019-04--10/is-anyone-listening -to-you-on-alexa-a-global-team-reviews-audio
[9]
Nicola Dell, Vidya Vaidyanathan, Indrani Medhi, Edward Cutrell, and William Thies. 2012. "Yours is Better!": Participant Response Bias in HCI. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '12). ACM, NY, NY, USA, 1321--1330.
[10]
Devanuj and Anirudha Joshi. 2013. Technology Adoption by 'Emergent' Users: The User-usage Model. In Proceedings of the 11th Asia Pacific Conference on Computer Human Interaction (APCHI '13). ACM, NY, NY, USA, 28--38.
[11]
Flanders News. 2019. Google employees are eavesdropping, even in your living room. Retrieved 2019--12--23 from https://vrtnws.be/p.DxW6YZ49y
[12]
Google. 2019a. Cloud Speech-to-Text. Retrieved 2019-09--20 from https://cloud.google.com/speech-to-text/
[13]
Google. 2019b. Cloud Translation. Retrieved 2019-09--20 from https://cloud.google.com/translate/
[14]
Google. 2019c. Google Assistant SDK | Google Developers. Retrieved 2019-09--20 from https://developers.google.com/assistant/sdk
[15]
Mary L. Gray and Siddharth Suri. 2019. Ghost work: how to stop Silicon Valley from building a new global underclass. Houghton Mifflin Harcourt, Boston, MA, USA.
[16]
Elizabeth Hallam and Tim Ingold (Eds.). 2007. Creativity and cultural improvisation. Berg, New York, NY, USA.
[17]
Richard H. R. Harper. 2019. The Role of HCI in the Age of AI. International Journal of Human--Computer Interaction 35, 15 (Sept. 2019), 1331--1344.
[18]
Lucy Hattersley. 2017. AIY Voice Essentials. Retrieved 2018-03-05 from https://www.raspberrypi.org/magpi/is sues/essentials-aiy-v1/
[19]
Tim Ingold. 2013. Making: anthropology, archaeology, art and architecture. Routledge, London, UK.
[20]
Ece Kamar. 2016. Directions in Hybrid Intelligence: Complementing AI Systems with Human Intelligence. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16). AAAI Press, New York, NY, USA, 4070--4073.
[21]
Vivek Kant and Anirudha Joshi. 2018. Challenges In Supporting The Emergent User. In Proceedings of the 9th Indian Conference on Human Computer Interaction (IndiaHCI '18). ACM Press, Bangalore, India, 67--70.
[22]
Josephine Lau, Benjamin Zimmerman, and Florian Schaub. 2018. Alexa, Are You Listening?: Privacy Perceptions, Concerns and Privacy-seeking Behaviors with Smart Speakers. Proc. ACM Hum.-Comput. Interact. 2, CSCW, Article 102 (Nov. 2018), 31 pages.
[23]
Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf Between User Expectation and Experience of Conversational Agents. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16). ACM, NY, NY, USA, 5286--5297.
[24]
Madhumita Murgia. 2019. AI's rise means work for army of data labellers. Financial Times. https://www.ft .com/content/56dde36c-aa40--11e9--984c-fac8325aaa04
[25]
Sheela Patel, Jockin Arputham, Sundar Burra, and Katia Savchuk. 2009. Getting the information base for Dharavi's redevelopment. Environment and Urbanization 21, 1 (April 2009), 241--251.
[26]
Jennifer Pearson, Simon Robinson, Thomas Reitmaier, Matt Jones, Shashank Ahire, Anirudha Joshi, Deepak Sahoo, Nimish Maravi, and Bhakti Bhikne. 2019a. StreetWise: Smart Speakers vs Human Help in Public Slum Settings. In CHI Conference on Human Factors in Computing Systems Proceedings (CHI '19). ACM, NY, NY, USA, Article 96, 13 pages.
[27]
Jennifer Pearson, Simon Robinson, Thomas Reitmaier, Matt Jones, and Anirudha Joshi. 2019b. Diversifying Future-Making Through Itinerative Design. ACM Trans. Comput.-Hum. Interact. 26, 5, Article 33 (July 2019), 21 pages.
[28]
Martin Porcheron, Joel E. Fischer, Stuart Reeves, and Sarah Sharples. 2018. Voice Interfaces in Everyday Life. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18). ACM, NY, NY, USA, Article 640, 12 pages.
[29]
Agha Ali Raza, Rajat Kulshreshtha, Spandana Gella, Sean Blagsvedt, Maya Chandrasekaran, Bhiksha Raj, and Roni Rosenfeld. 2016. Viral Spread via Entertainment and Voice-Messaging Among Telephone Users in India. In Proceedings of the Eighth International Conference on Information and Communication Technologies and Development (ICTD '16). ACM, NY, NY, USA, Article 1, 10 pages.
[30]
Simon Robinson, Jennifer Pearson, Shashank Ahire, Rini Ahirwar, Bhakti Bhikne, Nimish Maravi, and Matt Jones. 2018. Revisiting "Hole in the Wall" Computing: Private Smart Speakers and Public Slum Settings. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18). ACM, NY, NY, USA, Article 498, 11 pages.
[31]
Nithya Sambasivan, Ed Cutrell, Kentaro Toyama, and Bonnie Nardi. 2010. Intermediated Technology Use in Developing Communities. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '10). ACM, NY, NY, USA, 2583--2592.
[32]
Christine Satchell and Paul Dourish. 2009. Beyond the User: Use and Non-use in HCI. In Proceedings of the 21st Annual Conference of the Australian Computer-Human Interaction Special Interest Group: Design: Open 24/7 (OZCHI '09). ACM, NY, NY, USA, 9--16.
[33]
Swarachakra Team. 2019. Swarachakra Hindi Keyboard. Retrieved 2019--12--26 from https://play.google.com/store/apps/details?id=iit.an droid.swarachakra
[34]
Karen Taylor and Andrew C.K. Wiedlea. 2007. In Defense of Ugliness: The Role of Technical Presence in Critical Infrastructure System Endurance. In 2007 IEEE International Symposium on Technology and Society. IEEE, New York, NY, USA, 1--6.
[35]
Aditya Vashistha, Edward Cutrell, Gaetano Borriello, and William Thies. 2015. Sangeet Swara: A Community-Moderated Voice Forum in Rural India. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15). ACM, NY, NY, USA, 417--426.
[36]
Marion Walton, Vera Vukovic, and Gary Marsden. 2002. "Visual literacy' as challenge to the internationalisation of interfaces: a study of South African student web users. In CHI '02 Extended Abstracts on Human Factors in Computing Systems (CHI '02). ACM Press, Minneapolis, Minnesota, USA, 530--531.

Cited By

View all
  • (2020)CUI@CSCW: Collaborating through Conversational User InterfacesCompanion Publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing10.1145/3406865.3418587(483-492)Online publication date: 17-Oct-2020

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CHI '20: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems
April 2020
10688 pages
ISBN:9781450367080
DOI:10.1145/3313831
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 April 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. conversational agents
  2. emergent users
  3. public space interaction
  4. speech appliances

Qualifiers

  • Research-article

Funding Sources

  • EPSRC

Conference

CHI '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

Upcoming Conference

CHI '25
CHI Conference on Human Factors in Computing Systems
April 26 - May 1, 2025
Yokohama , Japan

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)46
  • Downloads (Last 6 weeks)3
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2020)CUI@CSCW: Collaborating through Conversational User InterfacesCompanion Publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing10.1145/3406865.3418587(483-492)Online publication date: 17-Oct-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media