Abstract
Since technology is growing in all fields, researchers are developing more advanced technologies in human–computer communication and security systems. The flexible wrist hinge and crowded background make it difficult to distinguish hands in uncontrolled conditions. We proposed a deep attention network, that can automatically localize the hand and classify the gestures. Available datasets were collected to train the deep neural network architecture to create a gesture recognition system. The datasets used are 20-bn-jester and NTU-HD datasets. 20-bn-jester dataset is RGB dataset and NTU-HD dataset is RGB-D dataset. The features are extracted from the dataset and fed into the models for training and classification. Two models Resnet-50 and VGG-16 are used for training and classification. In this work, a special network used for localizing hand regions is a deep attention network. Using an attention network, gesture recognition system has been created for static RGB-D images. The proposed architecture yielded a better accuracy of 91.32% using Resnet-50, and 89% using VGG-16, respectively. The significance of these results shows that one can create a hand gesture recognition system feasibly and prominently.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Haria A, Subramanian A, Asokkumar N, Poddar S, Nayak JS. Hand gesture recognition for human computer interaction. Procedia Comput Sci. 2017;115:367–74.
Alnaim N, Maysam A, Abdulrahman A. Hand gesture recognition using convolutional neural network for people who have experienced a stroke. In: 2019 3rd international symposium on multidisciplinary studies and innovative technologies (ISMSIT), IEEE; 2019. pp. 1–6.
Deng X, Zhang Y, Yang S, Tan P, Chang L, Yuan Y, Wang H. Joint hand detection and rotation estimation using CNN. IEEE Trans Image Process. 2017;27(4):1888–900.
Zhan F. Hand gesture recognition with convolution neural networks. In: 2019 IEEE 20th international conference on information reuse and integration for data science (IRI), IEEE; 2019. pp. 295–8.
Chung H-Y, Yao-Liang C, Wei-Feng T. An efficient hand gesture recognition system based on deep CNN.” In: 2019 IEEE international conference on industrial technology (ICIT), IEEE; 2019. pp. 853–8.
Al-Hammadi M, Ghulam M, Wadood A, Mansour A, Mohammed AB, Tareq SA, Hassan M, Mohamed AM. Deep learning-based approach for sign language gesture recognition with efficient hand gesture representation. IEEE Access. 2020;8:192527–42.
Köpüklü O, Ahmet G, Neslihan K, Gerhard R. Real-time hand gesture detection and classification using convolutional neural networks. In: 2019 14th IEEE international conference on automatic face & gesture recognition (FG 2019), IEEE; 2019. pp. 1–8.
Armandika F, Esmeralda CD, Fikri N, Fatan K. Dynamic hand gesture recognition using temporal-stream convolutional neural networks. In: 2020 7th international conference on electrical engineering, computer sciences and informatics (EECSI), IEEE; 2020. pp. 132-6.
Li Y, Wang X, Liu W, Feng B. Deep attention network for joint hand gesture localization and recognition using static RGB-D images. Inf Sci. 2018;441:66–78.
Materzynska J, Guillaume B, Ingo B, Roland M. The jester dataset: A large-scale video dataset of human gestures. In: Proceedings of the IEEE/CVF international conference on computer vision workshops; 2019.
Varun KS, Puneeth I, Jacob TP. Hand gesture recognition and implementation for disables using CNN’S. In: 2019 international conference on communication and signal processing (ICCSP), IEEE; 2019. pp. 0592–5.
Al-Hammadi M, Ghulam M, Wadood A, Mansour A, Hossain MS. Hand gesture recognition using 3D-CNN model. IEEE Consum Electron Mag. 2019;9(1):95–101.
Li Y, Wang X, Liu W, Feng B. Pose anchor: a single-stage hand keypoint detection network. IEEE Trans Circ Syst Video Technol. 2019;30(7):2104–13.
Du C, Zhang L, Sun X, Wang J, Sheng J. Enhanced multi-channel feature synthesis for hand gesture recognition based on CNN with a channel and spatial attention mechanism. IEEE Access. 2020;8:144610–20.
Zhou R, Junsong Y, Zhengyou Z. Robust hand gesture recognition based on finger-earth mover’s distance with a commodity depth camera. In: Proceedings of the 19th ACM international conference on Multimedia (MM ’11). Association for computing machinery, New York; 2011. pp. 1093–6.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
All the authors affirm that they do not have any competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Enabling Innovative Computational Intelligence Technologies for IOT” guest edited by Omer Rana, Rajiv Misra, Alexander Pfeiffer, Luigi Troiano and Nishtha Kesswani.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Moyya, P.D., Neelamsetti, K.K. & Thirumoorthy, G. Deep Attention Network for Enhanced Hand Gesture Recognition System. SN COMPUT. SCI. 4, 324 (2023). https://doi.org/10.1007/s42979-022-01590-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-022-01590-3