research-article

SoundCraft: Enabling Spatial Interactions on Smartwatches using Hand Generated Acoustics

Authors:

Keisuke Nakamura,

Pourang IraniAuthors Info & Claims

UIST '17: Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology

Pages 579 - 591

https://doi.org/10.1145/3126594.3126612

Published: 20 October 2017 Publication History

Abstract

We present SoundCraft, a smartwatch prototype embedded with a microphone array, that localizes angularly, in azimuth and elevation, acoustic signatures: non-vocal acoustics that are produced using our hands. Acoustic signatures are common in our daily lives, such as when snapping or rubbing our fingers, tapping on objects or even when using an auxiliary object to generate the sound. We demonstrate that we can capture and leverage the spatial location of such naturally occurring acoustics using our prototype. We describe our algorithm, which we adopt from the MUltiple SIgnal Classification (MUSIC) technique [31], that enables robust localization and classification of the acoustics when the microphones are required to be placed at close proximity. SoundCraft enables a rich set of spatial interaction techniques, including quick access to smartwatch content, rapid command invocation, in-situ sketching, and also multi-user around device interaction. Via a series of user studies, we validate SoundCraft's localization and classification capabilities in non-noisy and noisy environments.

Supplementary Material

suppl.mov (uistf2014-file3.mp4)

Supplemental video

Download
35.28 MB

References

[1]

Brian Amento, Will Hill, and Loren Terveen. 2002. The sound of one hand: a wrist-mounted bio-acoustic fingertip gesture interface. In CHI '02 Extended Abstracts on Human Factors in Computing Systems (CHI EA '02). ACM, NY, NY, USA, 724--725.

Digital Library

[2]

Daniel L. Ashbrook. 2010. Enabling Mobile Microinteractions. Ph.D. Dissertation. Georgia Institute of Technology, Atlanta, GA, USA. AAI3414437.

Digital Library

[3]

Wei-Hung Chen. 2015. Blowatch: Blowable and Handsfree Interaction for Smartwatches. In Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems (CHI EA '15). 103--108.

Digital Library

[4]

Jackson Feijó Filho, Wilson Prata, and Thiago Valle. 2015. Advances on Breathing Based Text Input for Mobile Devices. In International Conference on Universal Access in Human-Computer Interaction. 279-- 287. Springer International Publishing, 2015.

[5]

Mathieu Le Goc, Stuart Taylor, Shahram Izadi, and Cem Keskin. 2014. A low-cost transparent electric field sensor for 3d interaction on mobile devices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '14). 3167--3170.

Digital Library

[6]

Randy Gomez, Tatsuya Kawahara and Kazuhrio Nakadai. 2015. Optimized wavelet-domain filtering under noisy and reverberant conditions. APSIPA Transactions on Signal and Information Processing, 4, e3

[7]

Susumu Harada, James A. Landay, Jonathan Malkin, Xiao Li, and Jeff A. Bilmes. 2006. The vocal joystick:: evaluation of voice-based cursor control techniques. In Proceedings of the 8th international ACM SIGACCESS conference on Computers and accessibility (Assets '06). ACM, NY, NY, USA, 197--204.

Digital Library

[8]

Chris Harrison and Scott E. Hudson. 2008. Scratch input: creating large, inexpensive, unpowered and mobile finger input surfaces. In Proceedings of the 21st annual ACM symposium on User interface software and technology (UIST '08). 205--208.

Digital Library

[9]

Chris Harrison and Scott E. Hudson. 2009. Abracadabra: wireless, high-precision, and unpowered finger input for very small mobile devices. In Proceedings of the 22nd annual ACM symposium on User interface software and technology (UIST '09). 121124.

Digital Library

[10]

Chris Harrison, Desney Tan, and Dan Morris. 2010. Skinput: appropriating the body as an input surface. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '10). 453--462.

Digital Library

[11]

Chris Harrison, Robert Xiao, and Scott Hudson. 2012. Acoustic barcodes: passive, durable and inexpensive notched identification tags. In Proceedings of the 25th annual ACM symposium on User interface software and technology (UIST '12). 563--568.

Digital Library

[12]

Khalad Hasan, David Ahlström, and Pourang Irani. 2015. SAMMI: A Spatially-Aware Multi-Mobile Interface for Analytic Map Navigation Tasks. In Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI '15). ACM, NY, NY, USA, 36--45.

Digital Library

[13]

Takeo Igarashi and John F. Hughes. 2001. Voice as sound: using non-verbal voice input for interactive control. In Proceedings of the 14th annual ACM symposium on User interface software and technology (UIST '01). ACM, NY, NY, USA, 155--156.

Digital Library

[14]

Hiroshi Ishii, Craig Wisneski, Julian Orbanes, Ben Chun, and Joe Paradiso. 1999. PingPongPlus: design of an athletic-tangible interface for computer-supported cooperative play. In Proceedings of the SIGCHI conference on Human Factors in Computing Systems (CHI '99). 394--401.

Digital Library

[15]

Hamed Ketabdar, Mehran Roshandel, and Kamer Ali Yüksel. 2010. MagiWrite: towards touchless digit entry using 3D space around mobile devices. In Proceedings of the 12th international conference on Human computer interaction with mobile devices and services (MobileHCI '10). ACM, NY, NY, USA, 443--446.

Digital Library

[16]

David Kim, Otmar Hilliges, Shahram Izadi, Alex D. Butler, Jiawen Chen, Iason Oikonomidis, and Patrick Olivier. 2012. Digits: freehand 3D interactions anywhere using a wrist-worn gloveless sensor. In Proceedings of the 25th annual ACM symposium on User interface software and technology (UIST '12). 167--176.

Digital Library

[17]

Jungsoo Kim, Jiasheng He, Kent Lyons, and Thad Starner. 2007. The Gesture Watch: A Wireless Contactfree Gesture based Wrist Interface. In Proceedings of the 2007 11th IEEE International Symposium on Wearable Computers (ISWC '07). IEEE Computer Society. 1--8.

Digital Library

[18]

Sven Kratz and Michael Rohs. 2009. Hoverflow: exploring around-device interaction with IR distance sensors. In Proceedings of the 11th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI '09). Article 42, 4 pages.

Digital Library

[19]

MindMeld Launches Voice Assistant 2.0, Says Voice Search Growing ramatically. 2015. Retrieved August 25, 2016 from http://searchengineland.com/mindmeldlaunches-voice-assistant-2-0-says-voice-searchgrowing-dramatically-238130

[20]

Gierad Laput, Robert Xiao, and Chris Harrison. 2016. ViBand: High-Fidelity Bio-Acoustic Sensing Using Commodity Smartwatch Accelerometers. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology (UIST '16). ACM, NY, NY, USA, 321--333.

Digital Library

[21]

Pedro Lopes, Ricardo Jota, and Joaquim A. Jorge. 2011. Augmenting touch interaction through acoustic sensing. In Proceedings of the ACM International Conference on Interactive Tabletops and Surfaces (ITS '11). ACM, NY, NY, USA, 53--56.

Digital Library

[22]

Keisuke Nakamura, Kazuhiro Nakadai, Futoshi Asano, Yuji Hasegawa, and Hiroshi Tsujino. 2009. Intelligent sound source localization for dynamic environments. IEEE/RSJ International Conference on Intelligent Robots and Systems, St. Louis, MO, pp. 664--669.

Digital Library

[23]

OMG! Mobile voice survey reveals teens love to talk. 2014. Retrieved August 25, 2016 from https://googleblog.blogspot.ca/2014/10/omg-mobilevoice-survey-reveals-teens.html

[24]

Joseph A. Paradiso and Che King Leo. 2005. Tracking and characterizing knocks atop large interactive displays. Sensor Review, Vol. 25 Iss: 2, pp.134--143.

[25]

Shwetak N. Patel and Gregory D. Abowd. 2007. Blui: low-cost localized blowable user interfaces. In Proceedings of the 20th annual ACM symposium on User interface software and technology (UIST '07). 217220.

Digital Library

[26]

D. T. Pham, Ze Ji, Ming Yang, Zuobin Wang, and Mostafa Al-Kutubi. 2007. A novel human-computer interface based on passive acoustic localization. In Proceedings of the 12th international conference on Human-computer interaction: interaction platforms and techniques (HCI'07), Julie A. Jacko (Ed.). Springer-Verlag, Berlin, Heidelberg, 901--909.

Digital Library

[27]

Nissanka B. Priyantha, Anit Chakraborty, and Hari Balakrishnan. 2000. The Cricket location-support system. In Proceedings of the 6th annual international conference on Mobile computing and networking (MobiCom '00). ACM, NY, NY, USA, 32--43.

Digital Library

[28]

Jun Rekimoto. 2001. Gesturewrist and gesturepad: Unobtrusive wearable interaction devices. In Proceedings of Fifth International Symposium on Wearable Computers, 21--27. IEEE.

Digital Library

[29]

Gabriel Reyes, Dingtian Zhang, Sarthak Ghosh, Pratik Shah, Jason Wu, Aman Parnami, Bailey Bercik, Thad Starner, Gregory D. Abowd, and W. Keith Edwards. 2016. Whoosh: non-voice acoustics for low-cost, handsfree, and rapid input on smartwatches. In Proceedings of the 2016 ACM International Symposium on Wearable Computers (ISWC '16). ACM, NY, NY, USA, 120--127.

Digital Library

[30]

Daisuke Sakamoto, Takanori Komatsu, and Takeo Igarashi. 2013. Voice augmented manipulation: using paralinguistic information to manipulate mobile devices. In Proceedings of the 15th international conference on Human-computer interaction with mobile devices and services (MobileHCI '13). 69--78.

Digital Library

[31]

R. Schmidt. 1986. Multiple emitter location and signal parameter estimation. In IEEE Transactions on Antennas and Propagation, vol. 34, no. 3, pp. 276--280.

[32]

Project Soli. 2016. Retrieved August 25, 2016 from http://atap.google.com/soli.

[33]

Jie Song, Gábor Sörös, Fabrizio Pece, Sean Ryan Fanello, Shahram Izadi, Cem Keskin, and Otmar Hilliges. 2014. In-air gestures around unmodified mobile devices. In Proceedings of the 27th annual ACM symposium on User interface software and technology (UIST '14). 319--329.

Digital Library

[34]

Adam J. Sporka, Sri H. Kurniawan, Murni Mahmud, and Pavel Slavík. 2006. Non-speech input and speech recognition for real-time control of computer games. In Proceedings of the 8th international ACM SIGACCESS conference on Computers and accessibility (Assets '06). 213--220.

Digital Library

[35]

T. Scott Saponas, Desney S. Tan, Dan Morris, Ravin Balakrishnan, Jim Turner, and James A. Landay. 2009. Enabling always-available input with muscle-computer interfaces. In Proceedings of the 22nd annual ACM symposium on User interface software and technology (UIST '09). 167--176.

Digital Library

[36]

Wouter Van Vlaenderen, Jens Brulmans, Jo Vermeulen, and Johannes Schöning. 2015. WatchMe: A Novel Input Method Combining a Smartwatch and Bimanual Interaction. In Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems (CHI EA '15). 2091--2095.

Digital Library

[37]

Cheng-Yao Wang, Min-Chieh Hsiu, Po-Tsung Chiu, Chiao-Hui Chang, Liwei Chan, Bing-Yu Chen, and Mike Y. Chen. 2015. PalmGesture: Using Palms as Gesture Interfaces for Eyes-free Input. In Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI '15). 217--226.

Digital Library

[38]

Hongyi Wen, Julian Ramos Rojas, and Anind K. Dey. 2016. Serendipity: Finger Gesture Recognition using an Off-the-Shelf Smartwatch. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16). 3847--3851.

Digital Library

[39]

Anusha Withana, Roshan Peiris, Nipuna Samarasekara, and Suranga Nanayakkara. 2015. zSense: Enabling Shallow Depth Gesture Recognition for Greater Input Expressivity on Smart Wearables. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15). 3661--3670.

Digital Library

[40]

B. D. Van Veen and K. M. Buckley. 1988. Beamforming: a versatile approach to spatial filtering. In IEEE ASSP Magazine, vol. 5, no. 2, pp. 4--24.

[41]

Robert Xiao, Greg Lew, James Marsanico, Divya Hariharan, Scott Hudson, and Chris Harrison. 2014. Toffee: enabling ad hoc, around-device interaction with acoustic time-of-arrival correlation. In Proceedings of the 16th international conference on Human-computer interaction with mobile devices & services (MobileHCI '14). 67--76.

Digital Library

[42]

Cheng Zhang, Qiuyue Xue, Anandghan Waghmare, Sumeet Jain, Yiming Pu, Sinan Hersek, Kent Lyons, Kenneth A. Cunefare, Omer T. Inan, and Gregory D. Abowd. 2017. SoundTrak: Continuous 3D Tracking of a Finger Using Active Acoustics. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 1, 2, Article 30 (June 2017), 25 pages.

Digital Library

[43]

Yang Zhang and Chris Harrison. 2015. Tomo: Wearable, Low-Cost Electrical Impedance Tomography for Hand Gesture Recognition. In Proceedings of the 28th Annual ACM Symposium on User Interface Software & Technology (UIST '15). 167--173.

Digital Library

[44]

Yang Zhang, Junhan Zhou, Gierad Laput, and Chris Harrison. 2016. SkinTrack: Using the Body as an Electrical Waveguide for Continuous Finger Tracking on the Skin. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16). 1491--1503.

Digital Library

Cited By

Liu YShan QYao ZLu Q(2024)KeyFlow: Acoustic Motion Sensing for Cursor Control on Any KeyboardAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686348(1-3)Online publication date: 13-Oct-2024
https://dl.acm.org/doi/10.1145/3672539.3686348
Nath PDelamare WHasan K(2024)PalmSpaceInternational Journal of Human-Computer Studies10.1016/j.ijhcs.2024.103219184:COnline publication date: 1-Apr-2024
https://dl.acm.org/doi/10.1016/j.ijhcs.2024.103219
Villarreal-Narvaez SSluÿters AVanderdonckt JVatavu R(2023)Brave New GES World: A Systematic Literature Review of Gestures and Referents in Gesture Elicitation StudiesACM Computing Surveys10.1145/363645856:5(1-55)Online publication date: 7-Dec-2023
https://dl.acm.org/doi/10.1145/3636458
Show More Cited By

Index Terms

SoundCraft: Enabling Spatial Interactions on Smartwatches using Hand Generated Acoustics
1. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

Design of microphone arrays for hearing aids optimized to unknown subjects

The improvement of speech intelligibility in hearing aids is still an unsolved problem. Modern devices, either monaural or binaural, may include microphone arrays to provide directivity by means of spatial filtering, but the signals that arrive at the ...
Multichannel sound source dereverberation and separation for arbitrary number of sources based on Bayesian nonparametrics

Multichannel signal processing using a microphone array provides fundamental functions for coping with multisource situations, such as sound source localization and separation, that are needed to extract the auditory information for each source. ...
An efficient parameterization of the room transfer function

This paper proposes an efficient parameterization of the room transfer function (RTF). Typically, the RTF rapidly varies with varying source and receiver positions, hence requires an impractical number of point to point measurements to characterize a ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

UIST '17: Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology

October 2017

870 pages

ISBN:9781450349819

DOI:10.1145/3126594

General Chair:
Krzysztof Gajos
Harvard University
,
Program Chairs:
Jennifer Mankoff
Carnegie Mellon University
,
Chris Harrison
Carnegie Mellon University

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 October 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Honda Research Institute
NSERC
Mitacs

Conference

UIST '17

Sponsor:

UIST '17: The 30th Annual ACM Symposium on User Interface Software and Technology

October 22 - 25, 2017

QC, Québec City, Canada

Acceptance Rates

UIST '17 Paper Acceptance Rate 73 of 324 submissions, 23%;

Overall Acceptance Rate 561 of 2,567 submissions, 22%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

25
Total Citations
View Citations
723
Total Downloads

Downloads (Last 12 months)50
Downloads (Last 6 weeks)14

Reflects downloads up to 25 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Liu YShan QYao ZLu Q(2024)KeyFlow: Acoustic Motion Sensing for Cursor Control on Any KeyboardAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686348(1-3)Online publication date: 13-Oct-2024
https://dl.acm.org/doi/10.1145/3672539.3686348
Nath PDelamare WHasan K(2024)PalmSpaceInternational Journal of Human-Computer Studies10.1016/j.ijhcs.2024.103219184:COnline publication date: 1-Apr-2024
https://dl.acm.org/doi/10.1016/j.ijhcs.2024.103219
Villarreal-Narvaez SSluÿters AVanderdonckt JVatavu R(2023)Brave New GES World: A Systematic Literature Review of Gestures and Referents in Gesture Elicitation StudiesACM Computing Surveys10.1145/363645856:5(1-55)Online publication date: 7-Dec-2023
https://dl.acm.org/doi/10.1145/3636458
Cruz S(2023)A Design Framework For Equitable WearablesAdjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable Computing10.1145/3594739.3610766(241-245)Online publication date: 8-Oct-2023
https://dl.acm.org/doi/10.1145/3594739.3610766
Liu ZZhu ZJiang EHuang FVillanueva AQian XWang TRamani K(2023)InstruMentAR: Auto-Generation of Augmented Reality Tutorials for Operating Digital Instruments Through Recording Embodied DemonstrationProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581442(1-17)Online publication date: 19-Apr-2023
https://dl.acm.org/doi/10.1145/3544548.3581442
Xu CZhou BKrishnan GNayar S(2023)AO-Finger: Hands-free Fine-grained Finger Gesture Recognition via Acoustic-Optic Sensor FusingProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581264(1-14)Online publication date: 19-Apr-2023
https://dl.acm.org/doi/10.1145/3544548.3581264
Shibata YKawashima YIsogawa MIrie GKimura AAoki Y(2023)Listening Human Behavior: 3D Human Pose Estimation with Acoustic Signals2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52729.2023.01280(13323-13332)Online publication date: Jun-2023
https://doi.org/10.1109/CVPR52729.2023.01280
Msigwa CBaek SBernard DYun J(2022)Are You Wearing a Mask? Detecting If a Person Wears a Mask Using a WristbandSensors10.3390/s2205174522:5(1745)Online publication date: 23-Feb-2022
https://doi.org/10.3390/s22051745
Rateau HLank ELiu Z(2022)Leveraging Smartwatch and Earbuds Gesture Capture to Support Wearable InteractionProceedings of the ACM on Human-Computer Interaction10.1145/35677106:ISS(31-50)Online publication date: 14-Nov-2022
https://dl.acm.org/doi/10.1145/3567710
Neshati ASalo AFaleel SLi ZLiang HLatulipe CIrani P(2022)EdgeSelect: Smartwatch Data Interaction with Minimal Screen OcclusionProceedings of the 2022 International Conference on Multimodal Interaction10.1145/3536221.3556586(288-298)Online publication date: 7-Nov-2022
https://dl.acm.org/doi/10.1145/3536221.3556586
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents