Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Fast and Precise Touch-Based Text Entry for Head-Mounted Augmented Reality with Variable Occlusion

Published: 13 December 2018 Publication History

Abstract

We present the VISAR keyboard: An augmented reality (AR) head-mounted display (HMD) system that supports text entry via a virtualised input surface. Users select keys on the virtual keyboard by imitating the process of single-hand typing on a physical touchscreen display. Our system uses a statistical decoder to infer users’ intended text and to provide error-tolerant predictions. There is also a high-precision fall-back mechanism to support users in indicating which keys should be unmodified by the auto-correction process. A unique advantage of leveraging the well-established touch input paradigm is that our system enables text entry with minimal visual clutter on the see-through display, thus preserving the user’s field-of-view. We iteratively designed and evaluated our system and show that the final iteration of the system supports a mean entry rate of 17.75wpm with a mean character error rate less than 1%. This performance represents a 19.6% improvement relative to the state-of-the-art baseline investigated: A gaze-then-gesture text entry technique derived from the system keyboard on the Microsoft HoloLens. Finally, we validate that the system is effective in supporting text entry in a fully mobile usage scenario likely to be encountered in industrial applications of AR HMDs.

References

[1]
Aaron Bangor, Philip T. Kortum, and James T. Miller. 2008. An empirical evaluation of the system usability scale. International Journal of Human-Computer Interaction 24, 6 (2008), 574--594.
[2]
Xiaojun Bi, Barton A. Smith, and Shumin Zhai. 2012. Multilingual touchscreen keyboard design and optimization. Human-Computer Interaction 27, 4 (2012), 352--382.
[3]
Doug A. Bowman, Christopher J. Rhoton, and Marcio S. Pinho. 2002. Text input techniques for immersive virtual environments: An empirical comparison. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting. Vol. 46, SAGE Publications, 2154--2158.
[4]
John Brooke. 1996. SUS: A ‘quick and dirty’ usability scale. In Usability Evaluation In Industry. CRC Press, 189--194.
[5]
K. M. Chung, Jennifer T. T. Ji, and Richard Hau Yue So. 2011. Manual control with time delays in an immersive virtual environment. In Proceedings of the International Conference on Ergonomics 8 Human Factors: Contemporary Ergonomics and Human Factors 2011. CRC Press, 211--218.
[6]
Edward Clarkson, James Clawson, Kent Lyons, and Thad Starner. 2005. An empirical study of typing rates on mini-QWERTY keyboards. In Proceedings of CHI ’05 Extended Abstracts on Human Factors in Computing Systems (CHI EA’05). ACM, New York, NY, 1288--1291.
[7]
James Clawson, Kent Lyons, Alex Rudnick, Robert A. Iannucci, Jr., and Thad Starner. 2008. Automatic whiteout++: Correcting mini-QWERTY typing errors using keypress timing. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’08). ACM, New York, NYs, 573--582.
[8]
Paul A. David. 1985. Clio and the economics of QWERTY. American Economic Review 75, 2 (1985), 332--337.
[9]
Joshua Goodman, Gina Venolia, Keith Steury, and Chauncey Parker. 2002. Language modeling for soft keyboards. In Proceedings of the 7th International Conference on Intelligent User Interfaces (IUI’02). ACM, New York, NY, 194--195.
[10]
Tovi Grossman, Xiang Anthony Chen, and George Fitzmaurice. 2015. Typing on glasses: Adapting text entry to smart eyewear. In Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI’15). ACM, New York, NY, 144--152.
[11]
Reid Harmon, Walter Patterson, William Ribarsky, and Jay Bolter. 1996. The virtual annotation system. In Proceedings of the IEEE Virtual Reality Annual International Symposium, 1996. 239--245.
[12]
Juan David Hincapié-Ramos, Xiang Guo, Paymahn Moghadasian, and Pourang Irani. 2014. Consumed endurance: A metric to quantify arm fatigue of mid-air interactions. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 1063--1072.
[13]
Geoffrey Hinton, Li Deng, Dong Yu, George E. Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N. Sainath, and Brian Kingsbury. 2012. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Processing Magazine 29, 6 (2012), 82--97.
[14]
Errol R. Hoffmann. 1992. Fitts’ law with transmission delay. Ergonomics 35, 1 (1992), 37--48.
[15]
Lode Hoste and Beat Signer. 2013. SpeeG2: A speech- and gesture-based interface for efficient controller-free text input. In Proceedings of the 15th ACM on International Conference on Multimodal Interaction (ICMI’13). ACM, New York, NY, 213--220.
[16]
Seoktae Kim, Minjung Sohn, Jinhee Pak, and Woohun Lee. 2006. One-key keyboard: A very small QWERTY keyboard supporting text entry for wearable computing. In Proceedings of the 18th Australia Conference on Computer-Human Interaction: Design: Activities, Artefacts and Environments (OZCHI’06). ACM, New York, NY, 305--308.
[17]
Per Ola Kristensson. 2015. Next-generation text entry. Computer 48, 7 (2015), 84--87.
[18]
Per-Ola Kristensson and Shumin Zhai. 2004. SHARK 2: A large vocabulary shorthand writing system for pen-based computers. In Proceedings of the 17th Annual ACM Symposium on User Interface Software and Technology. ACM, 43--52.
[19]
Per-Ola Kristensson and Shumin Zhai. 2005. Relaxing stylus typing precision by geometric pattern matching. In Proceedings of the 10th International Conference on Intelligent User Interfaces (IUI’05). ACM, New York, NY, 151--158.
[20]
Falko Kuester, Michelle Chen, Mark E. Phair, and Carsten Mehring. 2005. Towards keyboard independent touch typing in VR. In Proceedings of the ACM Symposium on Virtual Reality Software and Technology (VRST’05). ACM, New York, NY, 86--95.
[21]
Minkyung Lee and Woontack Woo. 2003. ARKB: 3D vision-based augmented reality keyboard. In Proceedings of the 13th International Conference on Artificial Reality and Telexistence.
[22]
Stan J. Liebowitz and Stephen E. Margolis. 1990. The fable of the keys. Journal of Law 8 Economics 33, 1 (1990), 1--25.
[23]
Anders Markussen, Mikkel R. Jakobsen, and Kasper Hornbæk. 2013. Selection-Based Mid-Air Text Entry on Large Displays. Springer, Berlin, 401--418.
[24]
Anders Markussen, Mikkel Rønne Jakobsen, and Kasper Hornbæk. 2014. Vulture: A mid-air word-gesture keyboard. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’14). ACM, New York, NY, 1073--1082.
[25]
Tao Ni, Doug Bowman, and Chris North. 2011. AirStroke: Bringing unistroke text entry to freehand gesture interfaces. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’11). ACM, New York, NY, 2473--2476.
[26]
Antti Oulasvirta, Sakari Tamminen, Virpi Roto, and Jaana Kuorelahti. 2005. Interaction in 4-second bursts: The fragmented nature of attentional resources in mobile HCI. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 919--928.
[27]
Sebastian Pick, Andrew S. Puika, and Torsten W. Kuhlen. 2016. SWIFTER: Design and evaluation of a speech-based text input metaphor for immersive virtual environments. In Proceedings of the IEEE Symposium on 3D User Interfaces (3DUI’16). IEEE, 109--112.
[28]
Ivan Poupyrev, Numada Tomokazu, and Suzanne Weghorst. 1998. Virtual notepad: Handwriting in immersive VR. In Proceedings of the IEEE 1998 Virtual Reality Annual International Symposium. 126--132.
[29]
Manuel Prätorius, Dimitar Valkov, Ulrich Burgbacher, and Klaus Hinrichs. 2014. DigiTap: An eyes-free VR/AR symbolic input device. In Proceedings of the 20th ACM Symposium on Virtual Reality Software and Technology. ACM, 9--18.
[30]
Robert Rosenberg and Mel Slater. 1999. The chording glove: A glove-based text input device. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 29, 2 (1999), 186--191.
[31]
Srinath Sridhar, Anna Maria Feit, Christian Theobalt, and Antti Oulasvirta. 2015. Investigating the dexterity of multi-finger input for mid-air text entry. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI’15). ACM, New York, NY, 3643--3652.
[32]
Andreas Stolcke. 2002. SRILM -- An extensible language modeling toolkit. In Proceedings of International Conference on Spoken Language Processing. 901--904.
[33]
Jouke C. Verlinden, Jay David Bolter, and Charles van der Mast. 1993. Virtual Annotation: Verbal Communication in Virtual Reality. Technical Report GIT-GVU-93-40. Georgia Institute of Technology.
[34]
Keith Vertanen and Per Ola Kristensson. 2011. A versatile dataset for text entry evaluations based on genuine mobile emails. In Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services. ACM, 295--298.
[35]
Keith Vertanen, Haythem Memmi, Justin Emge, Shyam Reyal, and Per Ola Kristensson. 2015. VelociTap: Investigating fast mobile text entry using sentence-based decoding of touchscreen keyboard input. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. ACM, 659--668.
[36]
Cheng-Yao Wang, Wei-Chen Chu, Po-Tsung Chiu, Min-Chieh Hsiu, Yih-Harn Chiang, and Mike Y. Chen. 2015. PalmType: Using palms as keyboards for smart glasses. In Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI’15). ACM, New York, NY, 153--160.
[37]
Daryl Weir, Henning Pohl, Simon Rogers, Keith Vertanen, and Per Ola Kristensson. 2014. Uncertain text entry on mobile devices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’14). ACM, New York, NY, 2307--2316.
[38]
Xin Yi, Chun Yu, Mingrui Zhang, Sida Gao, Ke Sun, and Yuanchun Shi. 2015. ATK: Enabling ten-finger freehand typing in air based on 3D hand tracking data. In Proceedings of the 28th Annual ACM Symposium on User Interface Software and Technology (UIST’15). ACM, New York, NY, 539--548.
[39]
Chun Yu, Yizheng Gu, Zhican Yang, Xin Yi, Hengliang Luo, and Yuanchun Shi. 2017. Tap, dwell or gesture?: Exploring head-based text entry techniques for HMDs. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems (CHI’17). ACM, New York, NY, 4479--4488.
[40]
Chun Yu, Ke Sun, Mingyuan Zhong, Xincheng Li, Peijun Zhao, and Yuanchun Shi. 2016. One-dimensional handwriting: Inputting letters and words on smart glasses. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 71--82.
[41]
Shumin Zhai and Per Ola Kristensson. 2012. The word-gesture keyboard: Reimagining keyboard interaction. Communications of the ACM 55, 9 (Sept. 2012), 91--101.
[42]
Shumin Zhai, Per-Ola Kristensson, and Barton A. Smith. 2005. In search of effective text input interfaces for off the desktop computing. Interacting with Computers 17, 3 (2005), 229--250.

Cited By

View all
  • (2024)Evaluating Typing Performance in Different Mixed Reality Manifestations using Physiological FeaturesProceedings of the ACM on Human-Computer Interaction10.1145/36981428:ISS(377-406)Online publication date: 24-Oct-2024
  • (2024)OnArmQWERTY: An Empirical Evaluation of On-Arm Tap Typing for AR HMDsProceedings of the 2024 ACM Symposium on Spatial User Interaction10.1145/3677386.3682084(1-12)Online publication date: 7-Oct-2024
  • (2024)The Impact of Gaze and Hand Gesture Complexity on Gaze-Pinch Interaction PerformancesCompanion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing10.1145/3675094.3678990(622-626)Online publication date: 5-Oct-2024
  • Show More Cited By

Index Terms

  1. Fast and Precise Touch-Based Text Entry for Head-Mounted Augmented Reality with Variable Occlusion

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Computer-Human Interaction
    ACM Transactions on Computer-Human Interaction  Volume 25, Issue 6
    December 2018
    236 pages
    ISSN:1073-0516
    EISSN:1557-7325
    DOI:10.1145/3300063
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 December 2018
    Accepted: 01 June 2018
    Revised: 01 April 2018
    Received: 01 September 2017
    Published in TOCHI Volume 25, Issue 6

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Augmented reality
    2. text entry

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • Google Faculty Research Award
    • Trimble Fund
    • Google Faculty Research Award and EPSRC

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)238
    • Downloads (Last 6 weeks)14
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Evaluating Typing Performance in Different Mixed Reality Manifestations using Physiological FeaturesProceedings of the ACM on Human-Computer Interaction10.1145/36981428:ISS(377-406)Online publication date: 24-Oct-2024
    • (2024)OnArmQWERTY: An Empirical Evaluation of On-Arm Tap Typing for AR HMDsProceedings of the 2024 ACM Symposium on Spatial User Interaction10.1145/3677386.3682084(1-12)Online publication date: 7-Oct-2024
    • (2024)The Impact of Gaze and Hand Gesture Complexity on Gaze-Pinch Interaction PerformancesCompanion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing10.1145/3675094.3678990(622-626)Online publication date: 5-Oct-2024
    • (2024)StegoType: Surface Typing from Egocentric CamerasAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686762(1-14)Online publication date: 13-Oct-2024
    • (2024)TipTopTyping: A Thumb-to-Finger Text Input Method and Character Layout Optimized for Mobile Spatial ComputingProceedings of Mensch und Computer 202410.1145/3670653.3670669(196-206)Online publication date: 1-Sep-2024
    • (2024)StegoType: Surface Typing from Egocentric CamerasProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3654777.3676343(1-14)Online publication date: 13-Oct-2024
    • (2024)TouchInsight: Uncertainty-aware Rapid Touch and Text Input for Mixed Reality from Egocentric VisionProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3654777.3676330(1-16)Online publication date: 13-Oct-2024
    • (2024)TouchEditorProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36314547:4(1-29)Online publication date: 12-Jan-2024
    • (2024)SkiMR: Dwell-free Eye Typing in Mixed Reality2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR)10.1109/VR58804.2024.00065(439-449)Online publication date: 16-Mar-2024
    • (2024)Gesture2Text: A Generalizable Decoder for Word-Gesture Keyboards in XR Through Trajectory Coarse Discretization and Pre-TrainingIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.345619830:11(7118-7128)Online publication date: 1-Nov-2024
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media