Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1294211.1294238acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
Article

Gestures without libraries, toolkits or training: a $1 recognizer for user interface prototypes

Published: 07 October 2007 Publication History

Abstract

Although mobile, tablet, large display, and tabletop computers increasingly present opportunities for using pen, finger, and wand gestures in user interfaces, implementing gesture recognition largely has been the privilege of pattern matching experts, not user interface prototypers. Although some user interface libraries and toolkits offer gesture recognizers, such infrastructure is often unavailable in design-oriented environments like Flash, scripting environments like JavaScript, or brand new off-desktop prototyping environments. To enable novice programmers to incorporate gestures into their UI prototypes, we present a "$1 recognizer" that is easy, cheap, and usable almost anywhere in about 100 lines of code. In a study comparing our $1 recognizer, Dynamic Time Warping, and the Rubine classifier on user-supplied gestures, we found that $1 obtains over 97% accuracy with only 1 loaded template and 99% accuracy with 3+ loaded templates. These results were nearly identical to DTW and superior to Rubine. In addition, we found that medium-speed gestures, in which users balanced speed and accuracy, were recognized better than slow or fast gestures for all three recognizers. We also discuss the effect that the number of templates or training examples has on recognition, the score falloff along recognizers' N-best lists, and results for individual gestures. We include detailed pseudocode of the $1 recognizer to aid development, inspection, extension, and testing.

References

[1]
Anderson, D., Bailey, C. and Skubic, M. (2004) Hidden Markov Model symbol recognition for sketch-based interfaces. AAAI Fall Symposium. Menlo Park, CA: AAAI Press, 15--21.
[2]
Cao, X. and Balakrishnan, R. (2003) VisionWand: Interaction techniques for large displays using a passive wand tracked in 3D. Proc. UIST '03. New York: ACM Press, 173--182.
[3]
Cao, X. and Balakrishnan, R. (2005) Evaluation of an on-line adaptive gesture interface with command prediction. Proc. Graphics Interface '05. Waterloo, Ontario: CHCCS, 187--194.
[4]
Cho, M.G. (2006) A new gesture recognition algorithm and segmentation method of Korean scripts for gesture-allowed ink editor. Information Sciences 176 (9), 1290--1303.
[5]
Guimbretière, F., Stone, M. and Winograd, T. (2001) Fluid interaction with high-resolution wall-size displays. Proc. UIST '01. New York: ACM Press, 21--30.
[6]
Henry, T.R., Hudson, S.E. and Newell, G.L. (1990) Integrating gesture and snapping into a user interface toolkit. Proc. UIST '90. New York: ACM Press, 112--122.
[7]
Hinckley, K., Ramos, G., Guimbretiere, F., Baudisch, P. and Smith, M. (2004) Stitching: Pen gestures that span multiple displays. Proc. AVI '04. New York: ACM Press, 23--31.
[8]
Hong, J.I. and Landay, J.A. (2000) SATIN: A toolkit for informal ink-based applications. Proc. UIST '00. New York: ACM Press, 63--72.
[9]
Kara, L.B. and Stahovich, T.F. (2004) An image-based trainable symbol recognizer for sketch-based interfaces. AAAI Fall Symposium. Menlo Park, CA: AAAI Press, 99--105.
[10]
Karlson, A.K., Bederson, B.B. and SanGiovanni, J. (2005) AppLens and LaunchTile: Two designs for one-handed thumb use on small devices. Proc. CHI '05. New York: ACM Press, 201--210.
[11]
Kristensson, P. and Zhai, S. (2004) SHARK2: A large vocabulary shorthand writing system for pen-based computers. Proc. UIST '04. New York: ACM Press, 43--52.
[12]
Landay, J. and Myers, B.A. (1993) Extending an existing user interface toolkit to support gesture recognition. Adjunct Proc. CHI '93. New York: ACM Press, 91--92.
[13]
Lin, J., Newman, M.W., Hong, J.I. and Landay, J.A. (2000) DENIM: Finding a tighter fit between tools and practice for web site design. Proc. CHI '00. New York: ACM Press, 510--517.
[14]
Long, A.C., Landay, J.A. and Rowe, L.A. (1999) Implications for a gesture design tool. Proc. CHI '99. New York: ACM Press, 40--47.
[15]
Mitchell, T.M. (1997) Machine Learning. New York: McGraw-Hill.
[16]
Morris, M.R., Huang, A., Paepcke, A. and Winograd, T. (2006) Cooperative gestures: Multi-user gestural interactions for co-located groupware. Proc. CHI '06. New York: ACM Press, 1201--1210.
[17]
Myers, B.A., McDaniel, R.G., Miller, R.C., Ferrency, A.S., Faulring, A., Kyle, B.D., Mickish, A., Klimovitski, A. and Doane, P. (1997) The Amulet environment: New models for effective user interface software development. IEEE Trans. Software Engineering 23 (6), 347--365.
[18]
Myers, C.S. and Rabiner, L.R. (1981) A comparative study of several dynamic time-warping algorithms for connected word recognition. The Bell System Technical J. 60 (7), 1389--1409.
[19]
Notowidigdo, M. and Miller, R.C. (2004) Off-line sketch interpretation. AAAI Fall Symposium. Menlo Park, CA: AAAI Press, 120--126.
[20]
Pittman, J.A. (1991) Recognizing handwritten text. Proc. CHI '91. New York: ACM Press, 271--275.
[21]
Plamondon, R. and Srihari, S.N. (2000) On-line and off-line handwriting recognition: A comprehensive survey. IEEE Trans. Pattern Analysis & Machine Int. 22 (1), 63--84.
[22]
Press, W.H., Teukolsky, S.A., Vetterling, W.T. and Flannery, B.P. (1992) Numerical Recipes in C. Cambridge Univ. Press.
[23]
Rubine, D. (1991) Specifying gestures by example. Proc. SIGGRAPH '91. New York: ACM Press, 329--337.
[24]
Salvador, S. and Chan, P. (2004) FastDTW: Toward accurate dynamic time warping in linear time and space. 3rd Wkshp. on Mining Temporal and Sequential Data, ACM KDD '04. Seattle, Washington (August 22--25, 2004).
[25]
Sezgin, T.M. and Davis, R. (2005) HMM-based efficient sketch recognition. Proc. IUI '05. New York: ACM Press, 281--283.
[26]
Stojmenovi., M., Nayak, A. and Zunic, J. (2006) Measuring linearity of a finite set of points. Proc. CIS '06. Los Alamitos, CA: IEEE Press, 1--6.
[27]
Swigart, S. (2005) Easily write custom gesture recognizers for your Tablet PC applications. Tablet PC Technical Articles.
[28]
Tappert, C.C. (1982) Cursive script recognition by elastic matching. IBM J. of Research & Development 26 (6), 765--771.
[29]
Tappert, C.C., Suen, C.Y. and Wakahara, T. (1990) The state of the art in online handwriting recognition. IEEE Trans. Pattern Analysis & Machine Int. 12 (8), 787--808.
[30]
Vermunt, J.K. (1997) Log-linear Models for Event Histories. Thousand Oaks, CA: Sage Publications.
[31]
Wilson, A.D. and Shafer, S. (2003) XWand: UI for intelligent spaces. Proc. CHI '03. New York: ACM Press, 545--552.
[32]
Zhai, S. and Kristensson, P. (2003) Shorthand writing on stylus keyboard. Proc. CHI '03. New York: ACM Press, 97--104.

Cited By

View all
  • (2024)Hand Trajectory Recognition by Radar with a Finite-State Machine and a Bi-LSTMApplied Sciences10.3390/app1415678214:15(6782)Online publication date: 3-Aug-2024
  • (2024)Comparing Eyes-free Gestures to Gestures Produced in the Presence or Absence of Visual Feedback on Mobile DeviceProceedings of the 2024 International Conference on Advanced Visual Interfaces10.1145/3656650.3656651(1-5)Online publication date: 3-Jun-2024
  • (2024)Feminist Interaction Techniques: Social Consent Signals to Deter NCIM ScreenshotsProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3654777.3676380(1-14)Online publication date: 13-Oct-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
UIST '07: Proceedings of the 20th annual ACM symposium on User interface software and technology
October 2007
306 pages
ISBN:9781595936790
DOI:10.1145/1294211
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 October 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. dynamic time warping
  2. gesture recognition
  3. marks
  4. rapid prototyping
  5. recognition rates
  6. rubine
  7. statistical classifiers
  8. strokes
  9. symbols
  10. unistrokes
  11. user interfaces

Qualifiers

  • Article

Conference

UIST07

Acceptance Rates

Overall Acceptance Rate 561 of 2,567 submissions, 22%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)197
  • Downloads (Last 6 weeks)47
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Hand Trajectory Recognition by Radar with a Finite-State Machine and a Bi-LSTMApplied Sciences10.3390/app1415678214:15(6782)Online publication date: 3-Aug-2024
  • (2024)Comparing Eyes-free Gestures to Gestures Produced in the Presence or Absence of Visual Feedback on Mobile DeviceProceedings of the 2024 International Conference on Advanced Visual Interfaces10.1145/3656650.3656651(1-5)Online publication date: 3-Jun-2024
  • (2024)Feminist Interaction Techniques: Social Consent Signals to Deter NCIM ScreenshotsProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3654777.3676380(1-14)Online publication date: 13-Oct-2024
  • (2024)Inkeraction: An Interaction Modality Powered by Ink Recognition and SynthesisProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642498(1-26)Online publication date: 11-May-2024
  • (2024)Hands-On Robotics: Enabling Communication Through Direct Gesture ControlCompanion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3610978.3640635(822-827)Online publication date: 11-Mar-2024
  • (2024)Generating Virtual Reality Stroke Gesture Data from Out-of-Distribution Desktop Stroke Gesture Data2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR)10.1109/VR58804.2024.00093(732-742)Online publication date: 16-Mar-2024
  • (2024)Converting Tatamis into Touch Sensors by Measuring Capacitance2024 IEEE/SICE International Symposium on System Integration (SII)10.1109/SII58957.2024.10417676(554-558)Online publication date: 8-Jan-2024
  • (2024)Elicitation and Evaluation of Hand-based Interaction Language for 3D Conceptual Design in Mixed RealityInternational Journal of Human-Computer Studies10.1016/j.ijhcs.2023.103198183:COnline publication date: 14-Mar-2024
  • (2024)The impacts of situational visual impairment on usability of touch screensMultimedia Tools and Applications10.1007/s11042-024-18689-983:34(81685-81709)Online publication date: 9-Mar-2024
  • (2024)Pick, Click, Flick!undefinedOnline publication date: 14-Mar-2024
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media