Abstract
Human hand recognition plays an important role in a wide range of applications ranging from sign language translators, gesture recognition, augmented reality, surveillance and medical image processing to various Human Computer Interaction (HCI) domains. Human hand is a complex articulated object consisting of many connected parts and joints. Therefore, for applications that involve HCI one can find many challenges to establish a system with high detection and recognition accuracy for hand posture and/or gesture. Hand posture is defined as a static hand configuration without any movement involved. Meanwhile, hand gesture is a sequence of hand postures connected by continuous motions. During the past decades, many approaches have been presented for hand posture and/or gesture recognition. In this paper, we provide a survey on approaches which are based on Hidden Markov Models (HMM) for hand posture and gesture recognition for HCI applications.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Aaron F, Andrew D (1997) A state-based approach to the representation and recognition of gesture. IEEE Trans Pattern Anal and Mach Intell 19(12): 1325–1337
Aas K, Eikvil L, Huseby RB et al (1999) Application of hidden Markov chains in image analysis. Pattern Recognit 32(4): 703
AL-Rousan M, Assaleh K, AL-Rousan T (2009) Video-based signer-independent Arabic sign language recognition using hidden Markov models. Appl Soft Comput 9(3): 990–999
Aran O, Ari I, Akarun L, Sankur B, Benoit A, Caplier A, Campr P, Carrillo A, Fanarda F (2009) Sign tutor: an interactive system for sign language tutoring. IEEE Multimed 1: 81–93
Assaleh K, Shanableh T, Fanaswala M, Amin F, Bajaj H (2010) Continuous Arabic sign language recognition in user dependent mode. J Intell Learn Syst Appl 2: 19–27
Bauer B, Kraiss K-F (2001) Towards an automatic sign language recognition system using subunits. Gesture workshop, pp 64–75
Bilal S, Akmelawati R, Salami MJE, Shafie AA, Bouhabba EM (2010) A hybrid method using haar-like and skin-color algorithm for hand posture detection, recognition and tracking. In: Proceedings of international conference on mechatronics and automation (ICMA), Xi’an, August 2010, pp 934–939
Black MJ, Jepson AD (1998) A probabilistic framework for matching temporal trajectories: condensation-based recognition of gesture and expressions. In: Proceedings fifth European conference computer vision, pp 909–924
Braffort A (1996) ARGO: an architecture for sign language recognition and interpretation. In: Harling P, Edwards A (eds) Progress in gestural interaction. Springer, Berlin, pp 17–30
Brashear H, Park K-H, Lee S, Henderson V, Hamilton H, Starner T (2006) American sign language recognition in game development for deaf children. In: Proceedings of the Assets, pp 79–86
Del Rose MS, Wagner CC (2011) Survey on classifying human actions through visual sensors. J Artif Intell Rev. doi:10.1007/s10462-011-9232-z
Derpanis KG (2004) A review of vision-based hand gestures. Internal report, Department of Computer Science, York University
Elmezain M, Al-Hamadi A, Appenrodt J, Michaelis B (2008a) A hidden Markov model-based continuous gesture recognition system for hand motion trajectory. In: 19th international conference on pattern recognition, ICPR 2008, pp 1–4
Elmezain M, Al-Hamadi A, Michaelis B (2008b) Real-time capable system for hand gesture recognition using hidden markov models in stereo color image sequences. An international journal of algorithms, data structures and techniques for computer graphics and visualization, modeling, CAD & GIS systems, computer vision, image processing and pattern recognition, human computer interaction, animation and virtual reality, multimedia systems and applications in parallel, distributed and mobile environment (JWSCG) 2008 16(1): 65–72
Elmezain M, Al-Hamadi A, Appenrodt J, Michaelis B (2009) A hidden Markov model-based isolated and meaningful hand gesture recognition. Int J Electr Comput Syst Eng 3:3
Fang GL, Gao W (2002) A SRN/HMM system for signer independent continuous sign language recognition. In: Proceedings of the fifth international conference on automatic face and gesture recognition, pp 312–317
Fang GL, Gao XJ, Gao W, Chen YQ (2004) A novel approach to automatically extracting basic units from Chinese sign language. In: 17th international conference on pattern recognition (ICPR), Cambridge, England, pp 454–457
Fels SS, Hinton GE (1993) Glove-talk: a neural network interface between a data-glove and a speech synthesizer. IEEE Trans Neural Netw 4(1): 2–8
Fels SS, Hinton GE (1997) \({^{\rm \underline{a}}}\) Glove-talk II: “a neural network interface which maps gestures to parallel format speech synthesizer controls”. IEEE Trans Neural Netw 9(1): 205–212
Garg P, Aggarwal N, Sofat S (2009) Vision based hand gesture recognition. World Acad Sci Eng Technol 49: 973–977
Gao W, Ma JY, Wu JQ, Wang CL (2000) Sign language recognition based on HMM/ANN/DP. Int J Pattern Recognit Artif Intell 14(5): 587–602
Gao W, Fang G, Zhao D, Chen Y (2004) Transition movement models for large vocabulary continuous sign language recognition. In: Proceedings of the sixth IEEE international conference automatic face and gesture recognition, pp 553–558
Gao W, Fanga G, Zhaoa D, Chen Y (2004) A Chinese sign language recognition system based on SOFM/SRN/HMM. Pattern Recognit 37:2389–2402
Grobel K, Assan M (1997) Isolated sign language recognition using hidden Markov models. In: IEEE international conference on systems, man, and cybernetics, computational cybernetics and simulation, vol 1. pp 162–167
Gruenstein A (2002) Two methods of gesture recognition. Retrieved from http://www.mit.edu/~alexg/vision/review.pdf
Hidden Markov Model Toolkit (HTK)—Speech Recognition toolkit, (2011). Retrieved from June, 2011. http://htk.eng.cam.ac.uk/
Holden EJ, Lee G, Owens R (2005) Australian sign language recognition. Mach Vis Appl 16: 312–320
Hunter E, Schlenzig J, Jain R (1995) Posture estimation in reduced-model gesture input systems. In: Proceedings of the international workshop on automatic face-and gesture-recognition, pp 290–295
Isard M, Blake A (1998) Condensation—conditional density propagation for visual tracking. Int J Comput Vis 29(1): 5–28
Just A, Bernier O, Marcel S (2004) HMM and IOHMM for the recognition of mono- and bi-manual 3D hand gestures. In: British machine vision conference (BMVC)
Keskin C, Erkan A, Akarun L (2003) Real time hand tracking and 3D gesture recognition for interactive interfaces using HMM. In: Proceedings of international conference on artificial neural networks
Kim J-B, Park K-H, Bang W-C, Bien ZZ (2002a) Continuous korean sign language recognition using gesture segmentation and hidden markov modeL. In: RESNA 25th international conference on technology disability minneapolis USA
Kim J-B, Park K-H, Bang W-C, Bien ZZ (2002b) Continuous gesture recognition system for Korean sign language based on fuzzy logic and hidden Markov model. In: Proceedings of the 2002 IEEE international conference on fuzzy systems, FUZZ-IEEE’02, USA, pp. 1574–1579
Kobayashi T, Haruyama S (1997) Partly-hidden Markov model and its application to gesture recognition. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP) vol 4, pp 3081–3084
Lee H-K, Kim JH (1999) An HMM-based threshold model approach for gesture recognition. IEEE Trans Pattern Anal Mach Intell 21(10): 961–973
Liang R-H, Ouhyoung M (1996) A sign language recognition system using hidden Markov model and context sensitive search. In: Proceedings of the ACM symposium on virtual reality software and technology. ACM Press, pp 59–66
Liang R-H, Ouhyoung M (1998) A real-time continuous gesture recognition system for sign language. In: Proceedings of the third IEEE international conference on automatic face and gesture recognition, 1998, pp 558–567
Liddell SK, Johnson RE (1989) American sign language: the phonological base. Sign Lang Stud 64: 195–277
Ma J, Gao W, Wu J, Wang C (2000) A continuous Chinese sign language recognition system, In: Proceedings fourth IEEE international conference on automatic face and gesture recognition, 2000, pp 428–433
Maebatake M, Suzuki I, Nishida M, Horiuchi Y, Kuroiwa S (2008) Sign language recognition based on position and movement using multi-stream HMM. In: Second international symposium on universal communication
Manresa C, Varona J, Mas R, Perales FJ (2000) Real-time hand tracking and gesture recognition for human-computer interaction. Electron Lett Comput Vis Image Anal 0(0): 1–7
Mitra S, Acharya T (2007) Gesture recognition: a survey, systems, man, and cybernetics, part C: applications and reviews. IEEE Trans 37: 311–324
Munib Q, Habeeb M, Takruri B, Al-Malik HA (2007) American sign language (ASL) recognition based on hough transform and neural networks. Expert Syst Appl 32: 24–37
Murthy GRS, Jadon RS (2009) A review on vision based Hand gestures recognition. Int J Inf Technol Knowl Manag 2: 405–410
Nam Y, Wohn KY (1996) Recognition of space-time hand-gestures using hidden Markov model. In: ACM symposium on virtual reality software and technology, pp 51–58
Nam Y, Wohn KY (1999) Recognition and modeling of hand gestures using colored petri nets. IEEE Trans Syst Man Cybern 29: 514–521
Nguyen DB, Enokida S, Toshiaki E (2005) Real-time hand tracking and gesture recognition system. IGVIP05 conference, CICC, pp 362–368
Nianjun L, Brian CL, Peter JK, Richard AD (2004) Model structure selection & training algorithms for a HMM gesture recognition system. In: International IWFHR, pp 100–106
Philippine Federation of the Deaf (2005) Filipino sign language: a compilation of signs from regions of the Philippines, Part 1. LFS Printing Services, Inc., Quezon City
Rabiner LR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 77(2): 257–285
Ramamoorthya A, Vaswania N, Chaudhurya S, Banerjee S (2003) Recognition of dynamic hand gestures. Pattern Recognit 36: 2069–2081
Rigoll G, Kosmala A (1997) New improved feature extraction methods for real-time high performance image sequence recognition. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP), Munich, pp 2901–2904
Rigoll G, Kosmala A, Eickeler S (1997) High performance real-time gesture recognition using hidden Markov models. In: Proceedings of the Gesture workshop, Bielefeld, Germany
Rigoll G, Kosmala A, Eickeler S (1998) Hidden Markov model based continuous online gesture recognition. In: Internation conference on pattern recognition (ICPR), vol 2. pp 1206–1208
Sandjaja IN, Marcos N (2009) Sign language number recognitio. INC, IMS and IDC, NCM. In: Fifth international joint conference, pp 1503–1508
Starner T, Pentland A (1995a) Visual recognition of American sign language using hidden Markov models. Technical report TR-306, Media Lab, MIT
Starner T, Pentland A (1995b) Real-time American sign language recognition from video using hidden Markov models. Technical report TR-306, Media Lab, MIT
Tanibata N, Shimada N (2002) Extraction of hand features for recognition of sign language words. In: International conference on vision interface, pp 391–398
The Georgia Tech Gesture Toolkit (GT2k) (2011). Retrieved from June, 2011. http://gt2k.cc.gatech.edu/
Vassilia NP, Konstantinos GM (2003) On feature extraction and sign recognition for greek sign language. In: Proceedings of the 7th IASTED international conference artificial intelligence and soft computer, pp 93–98
Vogler C, Metaxas D (1997) Adapting hidden Markov models for ASL recognition by using three-dimensional computer vision methods. In: Proceedings of the IEEE international conference on systems, man and cybernetics, pp 156–161
Vogler C, Metaxas D (1997) Adapting hidden Markov models for ASL recognition by using three-dimensional computer vision methods. In: Proceedings of the IEEE international conference on systems, man and cybernetics, Orlando, FL, pp 156–161
Vogler C, Metaxas D (1998) ASL recognition based on a coupling between HMMs and 3D motion analysis. In: Proceedings of the international conference on computer vision. Mumbai, India, pp 363–369, January 4–7
Vogler C, Metaxas D (1999a) Toward scalability in ASL recognition: breaking down signs into phonemes. In: Gesture-based communication in human-computer interaction vol 1739, Lecture notes in artificial intelligence. Springer, Berlin, pp 211–224
Vogler C, Metaxas D (1999b) Parallel hidden Markov models for American sign language Recognition. In: Proceedings of the IEEE international conference on computer vision, Kerkyra, Greece, pp 116–122
Vogler C, Metaxas D (2000) A framework for recognizing the simultaneous aspects of American sign language. J Comput Vis Image Underst 81(3): 358–384
Wu Y, Huang TS (2001) Hand modeling analysis and recognition for vision-based human computer interaction. IEEE Signal Process Mag Special Issue Immers Interact Tech 18(3): 51–60
Yamato J, Ohya J, Ishii K (1992) Recognizing human action in time-sequential images using hidden Markov models. In: Proceedings of computer vision and pattern recognition (CVPR), Champaign, IL, pp 379–385
Yamato J, Ohya J, Ishii K (1992) Recognizing human action in time sequential images using hidden Markov model. In: Proceedings of the IEEE international conference computer vision and pattern recognition, Champaign, IL, pp 379–385
Yang M-H, Ahuja N (1999) Recognizing hand gesture using motion trajectories. In: IEEE CS conference on computer vision and pattern recognition, vol 1. pp 466–472
Yoon H-S, Soh J, Bae YJ, Yang HS (2001) Hand gesture recognition using combined features of location, angle and velocity. Pattern Recognit 34:1491–1501
Zafrulla Z, Brashear H, Yin P, Presti P, Starner T, Hamilton H (2010) American sign language phrase verification in an educational game for deaf children. In: International conference on pattern recognition, pp 3846–3849
Zhang L-G, Chen Y, Fang G, Chen X, GaoW(2005a) A vision-based sign language recognition system using tied-mixture density HMM. In: Proceedings of the 6th international conference on multimodal interfaces (ICMI)
Zhang L-G, Chen X, Wang C, Chen Y, Gao W (2005b) Recognition of sign language subwords based on boosted hidden Markov models. In: Proceedings of the 6th international conference on multimodal interfaces (ICMI)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bilal, S., Akmeliawati, R., Shafie, A.A. et al. Hidden Markov model for human to computer interaction: a study on human hand gesture recognition. Artif Intell Rev 40, 495–516 (2013). https://doi.org/10.1007/s10462-011-9292-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-011-9292-0