Human Action Recognition in Uncontrolled Environments: Application from Artificial Intelligence to Contactless Interfaces

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 770))

Included in the following conference series:

International Symposium on Ambient Intelligence

260 Accesses

Abstract

The recognition of human gestures and actions in images and videos is an active area of research in computer vision. This field has made great advances in the last decade thanks to the use of deep learning techniques. In addition, the recent diffusion of low-cost video camera systems, including depth cameras has enhanced the development of observation systems in a variety of application areas, such as video surveillance, home security, healthcare, etc. However, most of these developments are in closed and controlled environments. The recognition of movements and gestures in real-time through a camera that acquires its images in an uncontrolled environment (such as a shopping mall, a university lobby or a museum hall) and that allows interaction with passers-by in these spaces, contains challenges in various areas that include at least technological, social and legal challenges that need a careful approach. Within this framework, we set as objectives of this project the design of a web interface adapted to interaction without physical contact (i.e., through video images captured with a camera in real-time) and the construction of artificial intelligence models (based on deep learning) that guarantee, in an uncontrolled environment, an interaction with this web interface. As a first step in this work, we propose a literature review in this area. In addition, we include model recognition for a given set of gestures to explore the possibilities of different approaches.

This work was partially supported by Doctorado industrial, Comunidad Autónoma de La Rioja, and Grant PID2020-115225RB-I00 funded by MCIN/AEI/ 10.13039/501100011033.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Human Object Interaction: A Survey on Models and Their Key Challenges and Potential Applications in Future Fields

Contactless Human-Computer Interaction Using a Deep Neural Network Pipeline for Real-Time Video Interpretation and Classification

Deep Learning for Assistive Computer Vision

References

Ahad, M.A.R., Mahbub, U., Rahman, T.: Contactless Human Activity Analysis, vol. 200. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-68590-4
Casado-García, Á., et al.: CLoDSA: a tool for augmentation in classification, localization, detection, semantic segmentation and instance segmentation tasks. BMC Bioinform. 20(1), 1–14 (2019)
Article Google Scholar
Duarte, A., et al.: How2Sign: a large-scale multimodal dataset for continuous American sign language. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2735–2744 (2021)
Google Scholar
Gatys, L.A., Ecker, A.S., Bethge, M.: A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576 (2015)
Jegham, I., Khalifa, A.B., Alouani, I., Mahjoub, M.A.: Vision-based human action recognition: an overview and real world challenges. Forensic Sci. Int. Digit. Investig. 32, 200901 (2020)
Article Google Scholar
Liu, J., Akhtar, N., Mian, A.: Skepxels: spatio-temporal image representation of human skeleton joints for action recognition. In: CVPR Workshops, pp. 10–19 (2019)
Google Scholar
Mahmud, T., Hasan, M.: Vision-based human activity recognition. Contactless Hum. Act. Anal. 1–42 (2021)
Google Scholar
O’Malley, T., Bursztein, E., Long, J., Chollet, F., Jin, H., Invernizzi, L., et al.: Kerastuner (2019). https://github.com/keras-team/keras-tuner
Özyer, T., Ak, D.S., Alhajj, R.: Human action recognition approaches with video datasets-a survey. Knowl.-Based Syst. 222, 106995 (2021)
Article Google Scholar
Poulinakis, K.: Complete Practical Tutorial on Keras Tuner GitHub. https://github.com/Poulinakis-Konstantinos/Blogging-Journey/blob/main/Keras-Tuner-Complete-Tutorial/keras-tuner.ipynb. Accessed 25 Apr 2023
Sánchez-Caballero, A., Fuentes-Jiménez, D., Losada-Gutiérrez, C.: Real-time human action recognition using raw depth video-based recurrent neural networks. Multimed. Tools Appl. 1–23 (2022)
Google Scholar
Sarkar, A., Banerjee, A., Singh, P.K., Sarkar, R.: 3D human action recognition: through the eyes of researchers. Expert Syst. Appl. 116424 (2022)
Google Scholar
Sharif Razavian, A., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 806–813 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Matemáticas y Computación, Universidad de La Rioja, Logroño, Spain
Vanessa Alvear, César Domínguez & Gadea Mata
Innovación Riojana de Soluciones IT SL, Logroño, La Rioja, Spain
Vanessa Alvear

Authors

Vanessa Alvear
View author publications
You can also search for this author in PubMed Google Scholar
César Domínguez
View author publications
You can also search for this author in PubMed Google Scholar
Gadea Mata
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gadea Mata .

Editor information

Editors and Affiliations

University of Minho, Braga, Portugal
Paulo Novais
Universitat Politècnica de València, Valencia, Valencia, Spain
Vicente Julián Inglada
University of Granada, Granada, Spain
Miguel J. Hornos
National Institute of Informatics, Chiyoda, Japan
Ichiro Satoh
CIICESI, ESTG, Politécnico do Porto, Felgueiras, Portugal
Davide Carneiro
ISEP/GECAD, Porto, Portugal
João Carneiro
Deep tech lab, AIR Institute, Valladolid, Spain
Ricardo S. Alonso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alvear, V., Domínguez, C., Mata, G. (2023). Human Action Recognition in Uncontrolled Environments: Application from Artificial Intelligence to Contactless Interfaces. In: Novais, P., et al. Ambient Intelligence – Software and Applications – 14th International Symposium on Ambient Intelligence. ISAmI 2023. Lecture Notes in Networks and Systems, vol 770. Springer, Cham. https://doi.org/10.1007/978-3-031-43461-7_26

Download citation

DOI: https://doi.org/10.1007/978-3-031-43461-7_26
Published: 26 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43460-0
Online ISBN: 978-3-031-43461-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics