research-article

Fast and Precise Touch-Based Text Entry for Head-Mounted Augmented Reality with Variable Occlusion

Authors:

John J. Dudley,

Keith Vertanen,

Per Ola KristenssonAuthors Info & Claims

ACM Transactions on Computer-Human Interaction (TOCHI), Volume 25, Issue 6

Article No.: 30, Pages 1 - 40

https://doi.org/10.1145/3232163

Published: 13 December 2018 Publication History

Abstract

We present the VISAR keyboard: An augmented reality (AR) head-mounted display (HMD) system that supports text entry via a virtualised input surface. Users select keys on the virtual keyboard by imitating the process of single-hand typing on a physical touchscreen display. Our system uses a statistical decoder to infer users’ intended text and to provide error-tolerant predictions. There is also a high-precision fall-back mechanism to support users in indicating which keys should be unmodified by the auto-correction process. A unique advantage of leveraging the well-established touch input paradigm is that our system enables text entry with minimal visual clutter on the see-through display, thus preserving the user’s field-of-view. We iteratively designed and evaluated our system and show that the final iteration of the system supports a mean entry rate of 17.75wpm with a mean character error rate less than 1%. This performance represents a 19.6% improvement relative to the state-of-the-art baseline investigated: A gaze-then-gesture text entry technique derived from the system keyboard on the Microsoft HoloLens. Finally, we validate that the system is effective in supporting text entry in a fully mobile usage scenario likely to be encountered in industrial applications of AR HMDs.

References

[1]

Aaron Bangor, Philip T. Kortum, and James T. Miller. 2008. An empirical evaluation of the system usability scale. International Journal of Human-Computer Interaction 24, 6 (2008), 574--594.

[2]

Xiaojun Bi, Barton A. Smith, and Shumin Zhai. 2012. Multilingual touchscreen keyboard design and optimization. Human-Computer Interaction 27, 4 (2012), 352--382.

[3]

Doug A. Bowman, Christopher J. Rhoton, and Marcio S. Pinho. 2002. Text input techniques for immersive virtual environments: An empirical comparison. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting. Vol. 46, SAGE Publications, 2154--2158.

[4]

John Brooke. 1996. SUS: A ‘quick and dirty’ usability scale. In Usability Evaluation In Industry. CRC Press, 189--194.

[5]

K. M. Chung, Jennifer T. T. Ji, and Richard Hau Yue So. 2011. Manual control with time delays in an immersive virtual environment. In Proceedings of the International Conference on Ergonomics 8 Human Factors: Contemporary Ergonomics and Human Factors 2011. CRC Press, 211--218.

[6]

Edward Clarkson, James Clawson, Kent Lyons, and Thad Starner. 2005. An empirical study of typing rates on mini-QWERTY keyboards. In Proceedings of CHI ’05 Extended Abstracts on Human Factors in Computing Systems (CHI EA’05). ACM, New York, NY, 1288--1291.

Digital Library

[7]

James Clawson, Kent Lyons, Alex Rudnick, Robert A. Iannucci, Jr., and Thad Starner. 2008. Automatic whiteout++: Correcting mini-QWERTY typing errors using keypress timing. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’08). ACM, New York, NYs, 573--582.

Digital Library

[8]

Paul A. David. 1985. Clio and the economics of QWERTY. American Economic Review 75, 2 (1985), 332--337.

[9]

Joshua Goodman, Gina Venolia, Keith Steury, and Chauncey Parker. 2002. Language modeling for soft keyboards. In Proceedings of the 7th International Conference on Intelligent User Interfaces (IUI’02). ACM, New York, NY, 194--195.

Digital Library

[10]

Tovi Grossman, Xiang Anthony Chen, and George Fitzmaurice. 2015. Typing on glasses: Adapting text entry to smart eyewear. In Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI’15). ACM, New York, NY, 144--152.

Digital Library

[11]

Reid Harmon, Walter Patterson, William Ribarsky, and Jay Bolter. 1996. The virtual annotation system. In Proceedings of the IEEE Virtual Reality Annual International Symposium, 1996. 239--245.

Digital Library

[12]

Juan David Hincapié-Ramos, Xiang Guo, Paymahn Moghadasian, and Pourang Irani. 2014. Consumed endurance: A metric to quantify arm fatigue of mid-air interactions. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 1063--1072.

Digital Library

[13]

Geoffrey Hinton, Li Deng, Dong Yu, George E. Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N. Sainath, and Brian Kingsbury. 2012. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Processing Magazine 29, 6 (2012), 82--97.

[14]

Errol R. Hoffmann. 1992. Fitts’ law with transmission delay. Ergonomics 35, 1 (1992), 37--48.

[15]

Lode Hoste and Beat Signer. 2013. SpeeG2: A speech- and gesture-based interface for efficient controller-free text input. In Proceedings of the 15th ACM on International Conference on Multimodal Interaction (ICMI’13). ACM, New York, NY, 213--220.

Digital Library

[16]

Seoktae Kim, Minjung Sohn, Jinhee Pak, and Woohun Lee. 2006. One-key keyboard: A very small QWERTY keyboard supporting text entry for wearable computing. In Proceedings of the 18th Australia Conference on Computer-Human Interaction: Design: Activities, Artefacts and Environments (OZCHI’06). ACM, New York, NY, 305--308.

Digital Library

[17]

Per Ola Kristensson. 2015. Next-generation text entry. Computer 48, 7 (2015), 84--87.

Digital Library

[18]

Per-Ola Kristensson and Shumin Zhai. 2004. SHARK 2: A large vocabulary shorthand writing system for pen-based computers. In Proceedings of the 17th Annual ACM Symposium on User Interface Software and Technology. ACM, 43--52.

Digital Library

[19]

Per-Ola Kristensson and Shumin Zhai. 2005. Relaxing stylus typing precision by geometric pattern matching. In Proceedings of the 10th International Conference on Intelligent User Interfaces (IUI’05). ACM, New York, NY, 151--158.

Digital Library

[20]

Falko Kuester, Michelle Chen, Mark E. Phair, and Carsten Mehring. 2005. Towards keyboard independent touch typing in VR. In Proceedings of the ACM Symposium on Virtual Reality Software and Technology (VRST’05). ACM, New York, NY, 86--95.

Digital Library

[21]

Minkyung Lee and Woontack Woo. 2003. ARKB: 3D vision-based augmented reality keyboard. In Proceedings of the 13th International Conference on Artificial Reality and Telexistence.

[22]

Stan J. Liebowitz and Stephen E. Margolis. 1990. The fable of the keys. Journal of Law 8 Economics 33, 1 (1990), 1--25.

[23]

Anders Markussen, Mikkel R. Jakobsen, and Kasper Hornbæk. 2013. Selection-Based Mid-Air Text Entry on Large Displays. Springer, Berlin, 401--418.

[24]

Anders Markussen, Mikkel Rønne Jakobsen, and Kasper Hornbæk. 2014. Vulture: A mid-air word-gesture keyboard. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’14). ACM, New York, NY, 1073--1082.

Digital Library

[25]

Tao Ni, Doug Bowman, and Chris North. 2011. AirStroke: Bringing unistroke text entry to freehand gesture interfaces. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’11). ACM, New York, NY, 2473--2476.

Digital Library

[26]

Antti Oulasvirta, Sakari Tamminen, Virpi Roto, and Jaana Kuorelahti. 2005. Interaction in 4-second bursts: The fragmented nature of attentional resources in mobile HCI. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 919--928.

Digital Library

[27]

Sebastian Pick, Andrew S. Puika, and Torsten W. Kuhlen. 2016. SWIFTER: Design and evaluation of a speech-based text input metaphor for immersive virtual environments. In Proceedings of the IEEE Symposium on 3D User Interfaces (3DUI’16). IEEE, 109--112.

[28]

Ivan Poupyrev, Numada Tomokazu, and Suzanne Weghorst. 1998. Virtual notepad: Handwriting in immersive VR. In Proceedings of the IEEE 1998 Virtual Reality Annual International Symposium. 126--132.

Digital Library

[29]

Manuel Prätorius, Dimitar Valkov, Ulrich Burgbacher, and Klaus Hinrichs. 2014. DigiTap: An eyes-free VR/AR symbolic input device. In Proceedings of the 20th ACM Symposium on Virtual Reality Software and Technology. ACM, 9--18.

Digital Library

[30]

Robert Rosenberg and Mel Slater. 1999. The chording glove: A glove-based text input device. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 29, 2 (1999), 186--191.

Digital Library

[31]

Srinath Sridhar, Anna Maria Feit, Christian Theobalt, and Antti Oulasvirta. 2015. Investigating the dexterity of multi-finger input for mid-air text entry. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI’15). ACM, New York, NY, 3643--3652.

Digital Library

[32]

Andreas Stolcke. 2002. SRILM -- An extensible language modeling toolkit. In Proceedings of International Conference on Spoken Language Processing. 901--904.

[33]

Jouke C. Verlinden, Jay David Bolter, and Charles van der Mast. 1993. Virtual Annotation: Verbal Communication in Virtual Reality. Technical Report GIT-GVU-93-40. Georgia Institute of Technology.

[34]

Keith Vertanen and Per Ola Kristensson. 2011. A versatile dataset for text entry evaluations based on genuine mobile emails. In Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services. ACM, 295--298.

Digital Library

[35]

Keith Vertanen, Haythem Memmi, Justin Emge, Shyam Reyal, and Per Ola Kristensson. 2015. VelociTap: Investigating fast mobile text entry using sentence-based decoding of touchscreen keyboard input. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. ACM, 659--668.

Digital Library

[36]

Cheng-Yao Wang, Wei-Chen Chu, Po-Tsung Chiu, Min-Chieh Hsiu, Yih-Harn Chiang, and Mike Y. Chen. 2015. PalmType: Using palms as keyboards for smart glasses. In Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI’15). ACM, New York, NY, 153--160.

Digital Library

[37]

Daryl Weir, Henning Pohl, Simon Rogers, Keith Vertanen, and Per Ola Kristensson. 2014. Uncertain text entry on mobile devices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’14). ACM, New York, NY, 2307--2316.

Digital Library

[38]

Xin Yi, Chun Yu, Mingrui Zhang, Sida Gao, Ke Sun, and Yuanchun Shi. 2015. ATK: Enabling ten-finger freehand typing in air based on 3D hand tracking data. In Proceedings of the 28th Annual ACM Symposium on User Interface Software and Technology (UIST’15). ACM, New York, NY, 539--548.

Digital Library

[39]

Chun Yu, Yizheng Gu, Zhican Yang, Xin Yi, Hengliang Luo, and Yuanchun Shi. 2017. Tap, dwell or gesture?: Exploring head-based text entry techniques for HMDs. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems (CHI’17). ACM, New York, NY, 4479--4488.

Digital Library

[40]

Chun Yu, Ke Sun, Mingyuan Zhong, Xincheng Li, Peijun Zhao, and Yuanchun Shi. 2016. One-dimensional handwriting: Inputting letters and words on smart glasses. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 71--82.

Digital Library

[41]

Shumin Zhai and Per Ola Kristensson. 2012. The word-gesture keyboard: Reimagining keyboard interaction. Communications of the ACM 55, 9 (Sept. 2012), 91--101.

Digital Library

[42]

Shumin Zhai, Per-Ola Kristensson, and Barton A. Smith. 2005. In search of effective text input interfaces for off the desktop computing. Interacting with Computers 17, 3 (2005), 229--250.

Cited By

Chiossi FEl Khaoudi YOu CSidenmark LZaky AFeuchtner TMayer S(2024)Evaluating Typing Performance in Different Mixed Reality Manifestations using Physiological FeaturesProceedings of the ACM on Human-Computer Interaction10.1145/36981428:ISS(377-406)Online publication date: 24-Oct-2024
https://dl.acm.org/doi/10.1145/3698142
Darbar RHu XYan XWei YLiang HXu WSarcar S(2024)OnArmQWERTY: An Empirical Evaluation of On-Arm Tap Typing for AR HMDsProceedings of the 2024 ACM Symposium on Spatial User Interaction10.1145/3677386.3682084(1-12)Online publication date: 7-Oct-2024
https://dl.acm.org/doi/10.1145/3677386.3682084
Park YKim JOakley IKostakos VKay JHoang T(2024)The Impact of Gaze and Hand Gesture Complexity on Gaze-Pinch Interaction PerformancesCompanion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing10.1145/3675094.3678990(622-626)Online publication date: 5-Oct-2024
https://dl.acm.org/doi/10.1145/3675094.3678990
Show More Cited By

Index Terms

Fast and Precise Touch-Based Text Entry for Head-Mounted Augmented Reality with Variable Occlusion
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction techniques
      1. Text input

Recommendations

PinchType: Text Entry for Virtual and Augmented Reality Using Comfortable Thumb to Fingertip Pinches
CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems

Text entry is an integral component to many use cases in virtual and augmented reality. We present PinchType: A new method of virtual text entry that combines users' existing knowledge of the QWERTY keyboard layout with simple thumb and finger ...
iText: Hands-free Text Entry on an Imaginary Keyboard for Augmented Reality Systems
UIST '21: The 34th Annual ACM Symposium on User Interface Software and Technology

Text entry is an important and frequent task in interactive devices including augmented reality head-mounted displays (AR HMDs). In current AR HMDs, there are still two main open challenges to overcome for efficient and usable text entry: arm fatigue ...
EyeClick:A Robust Two-Step Eye-Hand Interaction for Text Entry in Augmented Reality Glasses
UIST '23 Adjunct: Adjunct Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology

Efficient text entry is a crucial aspect of the user experience for augmented reality (AR) head-mounted displays (HMD). Eye-tracking for virtual keyboard interaction is a popular choice for AR text entry, as it is intuitive, privacy-preserving, and ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Computer-Human Interaction

ACM Transactions on Computer-Human Interaction Volume 25, Issue 6

December 2018

236 pages

ISSN:1073-0516

EISSN:1557-7325

DOI:10.1145/3300063

Editor:
Ken Hinckley
Microsoft Research

Issue’s Table of Contents

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 December 2018

Accepted: 01 June 2018

Revised: 01 April 2018

Received: 01 September 2017

Published in TOCHI Volume 25, Issue 6

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

Google Faculty Research Award
Trimble Fund
Google Faculty Research Award and EPSRC

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

55
Total Citations
View Citations
1,419
Total Downloads

Downloads (Last 12 months)238
Downloads (Last 6 weeks)14

Reflects downloads up to 18 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Chiossi FEl Khaoudi YOu CSidenmark LZaky AFeuchtner TMayer S(2024)Evaluating Typing Performance in Different Mixed Reality Manifestations using Physiological FeaturesProceedings of the ACM on Human-Computer Interaction10.1145/36981428:ISS(377-406)Online publication date: 24-Oct-2024
https://dl.acm.org/doi/10.1145/3698142
Darbar RHu XYan XWei YLiang HXu WSarcar S(2024)OnArmQWERTY: An Empirical Evaluation of On-Arm Tap Typing for AR HMDsProceedings of the 2024 ACM Symposium on Spatial User Interaction10.1145/3677386.3682084(1-12)Online publication date: 7-Oct-2024
https://dl.acm.org/doi/10.1145/3677386.3682084
Park YKim JOakley IKostakos VKay JHoang T(2024)The Impact of Gaze and Hand Gesture Complexity on Gaze-Pinch Interaction PerformancesCompanion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing10.1145/3675094.3678990(622-626)Online publication date: 5-Oct-2024
https://dl.acm.org/doi/10.1145/3675094.3678990
Richardson MBotros FShi YSnow BGuo PZhang LDong JVertanen KMa SWang R(2024)StegoType: Surface Typing from Egocentric CamerasAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686762(1-14)Online publication date: 13-Oct-2024
https://dl.acm.org/doi/10.1145/3672539.3686762
Beier RWolling FHornecker EMichahelles F(2024)TipTopTyping: A Thumb-to-Finger Text Input Method and Character Layout Optimized for Mobile Spatial ComputingProceedings of Mensch und Computer 202410.1145/3670653.3670669(196-206)Online publication date: 1-Sep-2024
https://dl.acm.org/doi/10.1145/3670653.3670669
Richardson MBotros FShi YGuo PSnow BZhang LDong JVertanen KMa SWang R(2024)StegoType: Surface Typing from Egocentric CamerasProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3654777.3676343(1-14)Online publication date: 13-Oct-2024
https://dl.acm.org/doi/10.1145/3654777.3676343
Streli PRichardson MBotros FMa SWang RHolz C(2024)TouchInsight: Uncertainty-aware Rapid Touch and Text Input for Mixed Reality from Egocentric VisionProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3654777.3676330(1-16)Online publication date: 13-Oct-2024
https://dl.acm.org/doi/10.1145/3654777.3676330
Zhan LXiong TZhang HGuo SChen XGong JLin JQin Y(2024)TouchEditorProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36314547:4(1-29)Online publication date: 12-Jan-2024
https://dl.acm.org/doi/10.1145/3631454
Hu JDudley JKristensson P(2024)SkiMR: Dwell-free Eye Typing in Mixed Reality2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR)10.1109/VR58804.2024.00065(439-449)Online publication date: 16-Mar-2024
https://doi.org/10.1109/VR58804.2024.00065
Shen JKhaldi KZhou ESurale HKarlson A(2024)Gesture2Text: A Generalizable Decoder for Word-Gesture Keyboards in XR Through Trajectory Coarse Discretization and Pre-TrainingIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.345619830:11(7118-7128)Online publication date: 1-Nov-2024
https://dl.acm.org/doi/10.1109/TVCG.2024.3456198
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents