Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

A Fall Detection Network by 2D/3D Spatio-temporal Joint Models with Tensor Compression on Edge

Published: 12 December 2022 Publication History

Abstract

Falling is ranked highly among the threats in elderly healthcare, which promotes the development of automatic fall detection systems with extensive concern. With the fast development of the Internet of Things (IoT) and Artificial Intelligence (AI), camera vision-based solutions have drawn much attention for single-frame prediction and video understanding on fall detection in the elderly by using Convolutional Neural Network (CNN) and 3D-CNN, respectively. However, these methods hardly supervise the intermediate features with good accurate and efficient performance on edge devices, which makes the system difficult to be applied in practice. This work introduces a fast and lightweight video fall detection network based on a spatio-temporal joint-point model to overcome these hurdles. Instead of detecting fall motion by the traditional CNNs, we propose a Long Short-Term Memory (LSTM) model based on time-series joint-point features extracted from a pose extractor. We also introduce the increasingly mature RGB-D camera and propose 3D pose estimation network to further improve the accuracy of the system. We propose to apply tensor train decomposition on the model to reduce storage and computational consumption so the deployment on edge devices can to realized. Experiments are conducted to verify the proposed framework. For fall detection task, the proposed video fall detection framework achieves a high sensitivity of 98.46% on Multiple Cameras Fall, 100% on UR Fall, and 98.01% on NTU RGB-D 120. For pose estimation task, our 2D model attains 73.3 mAP in the COCO keypoint challenge, which outperforms the OpenPose by 8%. Our 3D model attains 78.6% mAP on NTU RGB-D dataset with 3.6× faster speed than OpenPose.

References

[1]
Umar Asif, Benjamin Mashford, Stefan Von Cavallar, Shivanthan Yohanandan, Subhrajit Roy, Jianbin Tang, and Stefan Harrer. 2020. Privacy preserving human fall detection using video data. In Proceedings of the Machine Learning for Health Workshop. 39–51.
[2]
Edouard Auvinet, Caroline Rougier, Jean Meunier, Alain St-Arnaud, and Jacqueline Rousseau. 2010. Multiple Cameras Fall Dataset. Technical report. DIRO-Université de Montréal, Tech. Rep. 1350.
[3]
Smriti Bhandari, Navnee Babar, Pranav Gupta, Nidhi Shah, and Shreyas Pujari. 2017. A novel approach for fall detection in home environment. In Proceedings of the IEEE 6th Global Conference on Consumer Electronics (GCCE). IEEE, 1–5.
[4]
Marc Bosch-Jorge, Antonio-José Sánchez-Salmerón, Ángel Valera, and Carlos Ricolfe-Viala. 2014. Fall detection based on the gravity vector using a wide-angle camera. Exp. Syst. Applic. 41, 17 (2014), 7980–7986.
[5]
Xi Cai, Suyuan Li, Xinyue Liu, and Guang Han. 2020. Vision-based fall detection with multi-task hourglass convolutional auto-encoder. IEEE Access 8 (2020), 44493–44502.
[6]
Zhe Cao, Gines Hidalgo, Tomas Simon, Shih-En Wei, and Yaser Sheikh. 2018. OpenPose: Realtime multi-person 2D pose estimation using part affinity fields. arXiv preprint arXiv:1812.08008 (2018).
[7]
Yie-Tarng Chen, Yu-Ching Lin, and Wen-Hsien Fang. 2010. A hybrid human fall detection scheme. In Proceedings of the IEEE International Conference on Image Processing. IEEE, 3485–3488.
[8]
Yuan Cheng, Guangtai Huang, Peining Zhen, Bin Liu, Hai-Bao Chen, Ngai Wong, and Hao Yu. 2020. An anomaly comprehension neural network for surveillance videos on terminal devices. In Proceedings of the Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 1396–1401.
[9]
Yuan Cheng, Guangya Li, Ngai Wong, Hai-Bao Chen, and Hao Yu. 2020. DEEPEYE: A deeply tensor-compressed neural network for video comprehension on terminal devices. ACM Trans. Embed. Comput. Syst. 19, 3 (2020), 1–25.
[10]
Yuan Cheng, Yuchao Yang, Hai-Bao Chen, Ngai Wong, and Hao Yu. 2021. S3-Net: A fast scene understanding network by single-shot segmentation for autonomous driving. ACM Trans. Intell. Syst. Technol. 12, 5 (2021), 1–19.
[11]
Guilhem Chéron, Ivan Laptev, and Cordelia Schmid. 2015. P-CNN: Pose-based CNN features for action recognition. In Proceedings of the IEEE International Conference on Computer Vision. 3218–3226.
[12]
Enea Cippitelli, Francesco Fioranelli, Ennio Gambi, and Susanna Spinsante. 2017. Radar and RGB-depth sensors for fall detection: A review. IEEE Sensors J. 17, 12 (2017), 3585–3604. DOI:
[13]
Qi Feng, Chenqiang Gao, Lan Wang, Yue Zhao, Tiecheng Song, and Qiang Li. 2020. Spatio-temporal fall event detection in complex scenes using attention guided LSTM. Pattern Recog. Lett. 130 (2020), 242–249.
[14]
Homa Foroughi, Baharak Shakeri Aski, and Hamidreza Pourreza. 2008. Intelligent video surveillance for monitoring fall detection of elderly in home environments. In Proceedings of the 11th International Conference on Computer and Information Technology. IEEE, 219–224.
[15]
Jesús Gutiérrez, Víctor Rodríguez, and Sergio Martin. 2021. Comprehensive review of vision-based fall detection systems. Sensors 21, 3 (2021), 947.
[16]
Fouzi Harrou, Nabil Zerrouki, Ying Sun, and Amrane Houacine. 2017. Vision-based fall detection system for improving safety of elderly people. IEEE Instrum. Measur. Mag. 20, 6 (2017), 49–55.
[17]
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision. 2961–2969.
[18]
Frank L. Hitchcock. 1927. The expression of a tensor or a polyadic as a sum of products. J. Math. Phys. 6, 1–4 (1927), 164–189.
[19]
Raul Igual, Carlos Medrano, and Inmaculada Plaza. 2013. Challenges, issues and trends in fall detection systems. Biomed. Eng. Onl. 12, 1 (2013), 66.
[20]
Anice Jahanjoo, Marjan Naderan, and Mohammad Javad Rashti. 2020. Detection and multi-class classification of falling in elderly people by deep belief network algorithms. J. Amb. Intell. Human. Comput. (2020), 1–21.
[21]
Muhammed Kocabas, Salih Karagoz, and Emre Akbas. 2018. MultiPoseNet: Fast multi-person pose estimation using pose residual network. In Proceedings of the European Conference on Computer Vision (ECCV). 417–433.
[22]
Bogdan Kwolek and Michal Kepski. 2014. Human fall detection on embedded platform using depth maps and wireless accelerometer. Comput. Meth. Prog. Biomed. 117, 3 (2014), 489–501.
[23]
Qiang Li, John A. Stankovic, Mark A. Hanson, Adam T. Barth, John Lach, and Gang Zhou. 2009. Accurate, fast fall detection using gyroscopes and accelerometer-derived posture information. In Proceedings of the 6th International Workshop on Wearable and Implantable Body Sensor Networks. IEEE, 138–143.
[24]
Ji Lin, Chuang Gan, and Song Han. 2019. TSM: Temporal shift module for efficient video understanding. In Proceedings of the IEEE International Conference on Computer Vision. 7083–7093.
[25]
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In Proceedings of the European Conference on Computer Vision. Springer, 740–755.
[26]
Jun Liu, Amir Shahroudy, Mauricio Perez, Gang Wang, Ling-Yu Duan, and Alex C. Kot. 2019. NTU RGB+D 120: A large-scale benchmark for 3D human activity understanding. IEEE Trans. Pattern Anal. Mach. Intell. (2019). DOI:
[27]
Na Lu, Yidan Wu, Li Feng, and Jinbo Song. 2018. Deep learning for fall detection: Three-dimensional CNN combined with LSTM on video kinematic data. IEEE J. Biomed. Health Inform. 23, 1 (2018), 314–323.
[28]
Julieta Martinez, Rayat Hossain, Javier Romero, and James J. Little. 2017. A simple yet effective baseline for 3D human pose estimation. In Proceedings of the IEEE International Conference on Computer Vision (ICCV).
[29]
Muhammad Mubashir, Ling Shao, and Luke Seed. 2013. A survey on fall detection: Principles and approaches. Neurocomputing 100 (2013), 144–152.
[30]
Juan C. Nunez, Raul Cabido, Juan J. Pantrigo, Antonio S. Montemayor, and Jose F. Velez. 2018. Convolutional neural networks and long short-term memory for skeleton-based human activity and hand gesture recognition. Pattern Recog. 76 (2018), 80–94.
[31]
Adrián Núñez-Marcos, Gorka Azkune, and Ignacio Arganda-Carreras. 2017. Vision-based fall detection with convolutional neural networks. Wirel. Commun. Mob. Comput. (2017).
[32]
Tanguy Ophoff, Kristof Van Beeck, and Toon Goedemé. 2019. Exploring RGB+ depth fusion for real-time object detection. Sensors 19, 4 (2019), 866.
[33]
World Health Organization, World Health Organization. Ageing, and Life Course Unit. 2008. WHO Global Report on Falls Prevention in Older Age. World Health Organization.
[34]
I. V. Oseledets. 2011. Tensor-train decomposition. SIAM J. Sci. Comput. 33, 5 (2011), 2295–2317. DOI:
[35]
George Papandreou, Tyler Zhu, Liang-Chieh Chen, Spyros Gidaris, Jonathan Tompson, and Kevin Murphy. 2018. PersonLab: Person pose estimation and instance segmentation with a bottom-up, part-based, geometric embedding model. In Proceedings of the European Conference on Computer Vision (ECCV). 269–286.
[36]
Sijie Song, Cuiling Lan, Junliang Xing, Wenjun Zeng, and Jiaying Liu. 2017. An end-to-end spatio-temporal attention model for human action recognition from skeleton data. In Proceedings of the 31st AAAI Conference on Artificial Intelligence.
[37]
Ledyard R. Tucker. 1966. Some mathematical notes on three-mode factor analysis. Psychometrika 31 (1966), 279–311. DOI:
[38]
V. Vaidehi, Kirupa Ganapathy, K. Mohan, A. Aldrin, and K. Nirmal. 2011. Video based automatic fall detection in indoor environment. In Proceedings of the International Conference on Recent Trends in Information Technology (ICRTIT). IEEE, 1016–1020.
[39]
Kun Wang, Guitao Cao, Dan Meng, Weiting Chen, and Wenming Cao. 2016. Automatic fall detection of human in video using combination of features. In Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 1228–1233.
[40]
Shengke Wang, Long Chen, Zixi Zhou, Xin Sun, and Junyu Dong. 2016. Human fall detection in surveillance video based on PCANet. Multim. Tools Applic. 75, 19 (2016), 11603–11613.
[41]
Yuchao Yang, Hongwei Ren, Chenghao Li, Chenchen Ding, and Hao Yu. 2021. An edge-device based fast fall detection using spatio-temporal optical flow model. In Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE, 5067–5071.
[42]
Nabil Zerrouki, Fouzi Harrou, Amrane Houacine, and Ying Sun. 2016. Fall detection using supervised machine learning algorithms: A comparative study. In Proceedings of the 8th International Conference on Modelling, Identification and Control (ICMIC). IEEE, 665–670.
[43]
Peining Zhen, Hai-Bao Chen, Yuan Cheng, Zhigang Ji, Bin Liu, and Hao Yu. 2021. Fast video facial expression recognition by a deeply tensor-compressed LSTM neural network for mobile devices. ACM Trans. Internet Things 2, 4 (2021), 1–26.

Cited By

View all
  • (2024)AI-Driven Privacy in Elderly Care: Developing a Comprehensive Solution for Camera-Based Monitoring of Older AdultsApplied Sciences10.3390/app1410415014:10(4150)Online publication date: 14-May-2024
  • (2024)Fall Detection for Elderly People using LiDAR Sensor2024 3rd International Conference on Artificial Intelligence For Internet of Things (AIIoT)10.1109/AIIoT58432.2024.10574642(1-6)Online publication date: 3-May-2024
  • (2024)Deep learning for computer vision based activity recognition and fall detection of the elderly: a systematic reviewApplied Intelligence10.1007/s10489-024-05645-154:19(8982-9007)Online publication date: 8-Jul-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Embedded Computing Systems
ACM Transactions on Embedded Computing Systems  Volume 21, Issue 6
November 2022
498 pages
ISSN:1539-9087
EISSN:1558-3465
DOI:10.1145/3561948
  • Editor:
  • Tulika Mitra
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

Publication History

Published: 12 December 2022
Online AM: 30 April 2022
Accepted: 26 March 2022
Revised: 23 March 2022
Received: 11 July 2021
Published in TECS Volume 21, Issue 6

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Fall detection system on board
  2. spatio-temporal joint-point model
  3. 2D/3D pose estimation
  4. tensorized compression

Qualifiers

  • Research-article
  • Refereed

Funding Sources

  • National Natural Science Foundation of China (NSFC)
  • Shenzhen Science and Technology Program
  • Innovative Team Program of Education Department of Guangdong Province

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)153
  • Downloads (Last 6 weeks)10
Reflects downloads up to 27 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)AI-Driven Privacy in Elderly Care: Developing a Comprehensive Solution for Camera-Based Monitoring of Older AdultsApplied Sciences10.3390/app1410415014:10(4150)Online publication date: 14-May-2024
  • (2024)Fall Detection for Elderly People using LiDAR Sensor2024 3rd International Conference on Artificial Intelligence For Internet of Things (AIIoT)10.1109/AIIoT58432.2024.10574642(1-6)Online publication date: 3-May-2024
  • (2024)Deep learning for computer vision based activity recognition and fall detection of the elderly: a systematic reviewApplied Intelligence10.1007/s10489-024-05645-154:19(8982-9007)Online publication date: 8-Jul-2024
  • (2016)Adaptive Friction CompensationAsian Journal of Control10.1002/asjc.130218:6(2100-2108)Online publication date: 1-Nov-2016

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media