Abstract
The recognition of human gestures and actions in images and videos is an active area of research in computer vision. This field has made great advances in the last decade thanks to the use of deep learning techniques. In addition, the recent diffusion of low-cost video camera systems, including depth cameras has enhanced the development of observation systems in a variety of application areas, such as video surveillance, home security, healthcare, etc. However, most of these developments are in closed and controlled environments. The recognition of movements and gestures in real-time through a camera that acquires its images in an uncontrolled environment (such as a shopping mall, a university lobby or a museum hall) and that allows interaction with passers-by in these spaces, contains challenges in various areas that include at least technological, social and legal challenges that need a careful approach. Within this framework, we set as objectives of this project the design of a web interface adapted to interaction without physical contact (i.e., through video images captured with a camera in real-time) and the construction of artificial intelligence models (based on deep learning) that guarantee, in an uncontrolled environment, an interaction with this web interface. As a first step in this work, we propose a literature review in this area. In addition, we include model recognition for a given set of gestures to explore the possibilities of different approaches.
This work was partially supported by Doctorado industrial, Comunidad Autónoma de La Rioja, and Grant PID2020-115225RB-I00 funded by MCIN/AEI/ 10.13039/501100011033.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ahad, M.A.R., Mahbub, U., Rahman, T.: Contactless Human Activity Analysis, vol. 200. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-68590-4
Casado-García, Á., et al.: CLoDSA: a tool for augmentation in classification, localization, detection, semantic segmentation and instance segmentation tasks. BMC Bioinform. 20(1), 1–14 (2019)
Duarte, A., et al.: How2Sign: a large-scale multimodal dataset for continuous American sign language. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2735–2744 (2021)
Gatys, L.A., Ecker, A.S., Bethge, M.: A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576 (2015)
Jegham, I., Khalifa, A.B., Alouani, I., Mahjoub, M.A.: Vision-based human action recognition: an overview and real world challenges. Forensic Sci. Int. Digit. Investig. 32, 200901 (2020)
Liu, J., Akhtar, N., Mian, A.: Skepxels: spatio-temporal image representation of human skeleton joints for action recognition. In: CVPR Workshops, pp. 10–19 (2019)
Mahmud, T., Hasan, M.: Vision-based human activity recognition. Contactless Hum. Act. Anal. 1–42 (2021)
O’Malley, T., Bursztein, E., Long, J., Chollet, F., Jin, H., Invernizzi, L., et al.: Kerastuner (2019). https://github.com/keras-team/keras-tuner
Özyer, T., Ak, D.S., Alhajj, R.: Human action recognition approaches with video datasets-a survey. Knowl.-Based Syst. 222, 106995 (2021)
Poulinakis, K.: Complete Practical Tutorial on Keras Tuner GitHub. https://github.com/Poulinakis-Konstantinos/Blogging-Journey/blob/main/Keras-Tuner-Complete-Tutorial/keras-tuner.ipynb. Accessed 25 Apr 2023
Sánchez-Caballero, A., Fuentes-Jiménez, D., Losada-Gutiérrez, C.: Real-time human action recognition using raw depth video-based recurrent neural networks. Multimed. Tools Appl. 1–23 (2022)
Sarkar, A., Banerjee, A., Singh, P.K., Sarkar, R.: 3D human action recognition: through the eyes of researchers. Expert Syst. Appl. 116424 (2022)
Sharif Razavian, A., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 806–813 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Alvear, V., Domínguez, C., Mata, G. (2023). Human Action Recognition in Uncontrolled Environments: Application from Artificial Intelligence to Contactless Interfaces. In: Novais, P., et al. Ambient Intelligence – Software and Applications – 14th International Symposium on Ambient Intelligence. ISAmI 2023. Lecture Notes in Networks and Systems, vol 770. Springer, Cham. https://doi.org/10.1007/978-3-031-43461-7_26
Download citation
DOI: https://doi.org/10.1007/978-3-031-43461-7_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43460-0
Online ISBN: 978-3-031-43461-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)