Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3447450.3447473acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvipConference Proceedingsconference-collections
research-article

Research Approach of Hand Gesture Recognition based on Improved YOLOV3 network and Bayes classifier

Published: 09 April 2021 Publication History

Abstract

The study of gesture recognition is closely related to the harmonious development of human-computer interaction, which has important research significance. Aiming at the problems of long detection time and low recognition efficiency of traditional gesture detection algorithm, this paper proposes a gesture recognition model based on the combination of improved the YOLOV3 network and the Bayes classifier. The spatial transformer network is used to improve the YOLOV3 network for processing gesture information and extract key gesture features, so as to solve the problem of data vulnerability while maintaining the depth extraction of feature information. Then, the features are input into the combined model of PCA network and Bayes classifier for predicting gesture categories with reducing the dimension of data and improve the classification accuracy. Finally, the comparison test is performed using the public and self-made dataset, and experimental result illustrate that the propose algorithm can improve detection accuracy with higher effectiveness.

References

[1]
Liu Chunhua. 2017.Research on gesture Recognition Technology based on Wearable Controller. Master's Thesis, Harbin University of Science and Technology.
[2]
Liang Zhijie. 2019. Research on the Key Technology of Deaf-Mute Sign Language recognition. PhD Thesis, Central China Normal University.
[3]
Tian Yi. 2006. Based on data glove hand gesture interaction. Master's Thesis, EECS Department, Zhejiang University.
[4]
Pansare J R, Gawande S H and Ingle M. 2012. Real-Time Static Hand Gesture Recognition for American Sign Language (ASL) in Complex Background. J. Journal of Signal & Information Processing (August 2012).
[5]
Singha J, Roy A, and Laskar R H. 2016. Dynamic hand gesture recognition using vision-based approach for human–computer interaction. J. Neural Computing and Applications.
[6]
Pan Zhigeng, Wang Shunting, and Yao Zhengwei. 2018. Based on the improved convex decomposition of gesture recognition.  J. Journal of Beijing Institute of Technology 38, 3 (March 2006), 279-285.
[7]
Tan Taizhe. Han Yawei, and Shao Yang. 2018. Gesture recognition method based on RGB 3-d images. J. Computer Engineering and Design 39, 2 (February 2018), 511–515.
[8]
Rwigema J, Choi H R, and Kim T Y .2019. A Differential Evolution Approach to Optimize Weights of Dynamic Time Warping for Multi-Sensor Based Gesture Recognition. J. Sensors 19, 5.
[9]
Alani A A, Cosma G, and Taherkhani A. 2018.Hand gesture recognition using an adapted convolutional neural network with data augmentation.2018 4th International Conference on Information Management (ICIM). IEEE.
[10]
Liao B, Li J, and Ju Z .2018. Hand Gesture Recognition with Generalized Hough Transform and DC-CNN Using Realsense. Eighth International Conference on Information Science and Technology (ICIST). IEEE.
[11]
Al-Hammadi M, Muhammad G, and Abdul W. 2019.Hand Gesture Recognition Using 3D-CNN Model. J. IEEE Consumer Electronics Magazine 9,1(December 2019),95-101.
[12]
Rubin, B. S, and Sathiesh K V. 2019. Hand Gesture Recognition Using Faster R-CNN Inception V2 Model. AIR 2019: Advances in Robotics.
[13]
Sruthi C J, Lijiya A .2019. Signet: A Deep Learning based Indian Sign Language Recognition System. International Conference on Communication and Signal Processing (ICCSP).
[14]
Lu D, Qiu C, and Xiao Y. 2018. Temporal convolutional neural network for gesture recognition. J. 367-371.
[15]
Kim J H, Hong G S, Kim B G, and Dogra D P. 2018. Deep gesture: deep learning-based gesture recognition scheme using motion sensors. J. Displays 55(December 2018), 38-45.
[16]
Chi D, Jun L, Jun Y, and Hao Y. 2018. A gesture recognition method based on deep learning. J. Control and Information Technology (June 1999).
[17]
Redmon J, and Farhadi A. 2018. Yolov3: an incremental improvement. J. arXiv (April 2018), e-prints.
[18]
LeCun, Y, Bengio, Y and Hinton G. 2015.Deep learning. J. Nature 521(May 2015),436–444. https://doi.org/10.1038/nature14539
[19]
Jaderberg Max, Simonyan Karen, Zisserman Andrew and Kavukcuoglu Koray. 2015. Spatial Transformer Networks. Advances in Neural Information Processing Systems 28 (NIPS 2015).
[20]
Régis Behmo, Marcombes, P, Dalalyan, A, and Véronique Prinet. 2010. Towards Optimal Naive Bayes Nearest Neighbor. European Conference on Computer Vision. Springer, Berlin, Heidelberg.

Cited By

View all
  • (2022)Early Smoke Detection Based on Improved YOLO-PCA NetworkFire10.3390/fire50200405:2(40)Online publication date: 22-Mar-2022
  • (2022)American Sign Language Words Recognition Using Spatio-Temporal Prosodic and Angle Features: A Sequential Learning ApproachIEEE Access10.1109/ACCESS.2022.314813210(15911-15923)Online publication date: 2022
  • (2022)Gesture recognition based on modified Yolov5sIET Image Processing10.1049/ipr2.1247716:8(2124-2132)Online publication date: 18-Mar-2022

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICVIP '20: Proceedings of the 2020 4th International Conference on Video and Image Processing
December 2020
255 pages
ISBN:9781450389075
DOI:10.1145/3447450
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 April 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Bayes Classification
  2. Deep learning
  3. Gesture Recognition
  4. Spatial Transformer Network

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Natural Science Foundation of Liaoning Province General Project
  • The National Natural Science Foundation of China

Conference

ICVIP 2020

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)2
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Early Smoke Detection Based on Improved YOLO-PCA NetworkFire10.3390/fire50200405:2(40)Online publication date: 22-Mar-2022
  • (2022)American Sign Language Words Recognition Using Spatio-Temporal Prosodic and Angle Features: A Sequential Learning ApproachIEEE Access10.1109/ACCESS.2022.314813210(15911-15923)Online publication date: 2022
  • (2022)Gesture recognition based on modified Yolov5sIET Image Processing10.1049/ipr2.1247716:8(2124-2132)Online publication date: 18-Mar-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media