Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3191442.3191452acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicigpConference Proceedingsconference-collections
research-article

Utilization of Color-depth Combination Features and Multi-level Refinement CNN for Upper-limb Posture Recognition

Published: 24 February 2018 Publication History

Abstract

Upper-limb posture recognition is of great value to rehabilitation and assessment of stroke patients. In this paper, we propose a novel method for upper-limb posture recognition. Convolutional neural network (CNN) cascade is applied to reduce the training difficulty of the algorithm. Information of depth and color is combined to eliminate the influence of complex background and illumination variation. Kinect is used to automatically acquire a large number of upper limb posture labels. The principle of coarse-to-fine runs through the whole algorithm. The overall network architecture consists of 3 levels with a total of six CNNs. First of all, color and depth images are aligned to obtain a RGB-D quad channels image. The quad-channel image is sent to level-1 cascade network to obtain a bounding box containing the upper limb cropped from the entire body. Then, the resulting bounding box is brought into level-2 cascade network and 4 sets of rough upper-limb joints coordinates are obtained. Finally, zoom in the visual field to local area of 4 key points, 4 sets of accurate coordinate is obtained by level-3 cascade network. Experimental results show that upper-limb posture is calculated by the proposed algorithm that has strong stability to both illumination and background problems.

References

[1]
Shotton, J., Blake, A. and Cipolla, R. 2005. Contour-based learning for object detection. Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 (Oct. 2005), 503--510 Vol. 1.
[2]
Bourdev, L., Maji, S., Brox, T. and Malik, J. 2010. Detecting People Using Mutually Consistent Poselet Activations. Computer Vision -- ECCV 2010 (Sep. 2010), 168--181.
[3]
Rogez, G., Rihan, J., Ramalingam, S., Orrite, C. and Torr, P.H.S. 2008. Randomized trees for human pose detection. 2008 IEEE Conference on Computer Vision and Pattern Recognition (Jun. 2008), 1--8.
[4]
Baak, A., Müller, M., Bharaj, G., Seidel, H.-P. and Theobalt, C. 2013. A Data-Driven Approach for Real-Time Full Body Pose Reconstruction from a Depth Camera. Consumer Depth Cameras for Computer Vision. Springer, London. 71--98.
[5]
Grest, D., Woetzel, J. and Koch, R. 2005. Nonlinear body pose estimation from depth images. DAGM-Symposium (2005), 285--292.
[6]
Knoop, S., Vacek, S. and Dillmann, R. 2009. Fusion of 2D and 3D sensor data for articulated body tracking. Robotics and Autonomous Systems. 57, 3 (2009), 321--329.
[7]
Grest, D., Krüger, V. and Koch, R. 2007. Single view motion tracking by depth and silhouette information. Image Analysis. (2007), 719--729.
[8]
Schwarz, L.A., Mateus, D., Castañeda, V. and Navab, N. 2010. Manifold learning for tof-based human body tracking and activity recognition. BMVC (2010), 1--11.
[9]
Schwarz, L.A., Mateus, D. and Navab, N. 2012. Recognizing multiple human activities and tracking full-body pose in unconstrained environments. Pattern Recognition. 45, 1 (2012), 11--23.
[10]
Shotton, J., Sharp, T., Kipman, A., Fitzgibbon, A., Finocchio, M., Blake, A., Cook, M. and Moore, R. 2013. Real-time human pose recognition in parts from single depth images. Communications of the ACM. 56, 1 (2013), 116--124.
[11]
Sun, Y., Wang, X. and Tang, X. 2013. Deep convolutional network cascade for facial point detection. Proceedings of the IEEE conference on computer vision and pattern recognition (2013), 3476--3483.
[12]
Yang, L., Zhang, L., Dong, H., Alelaiwi, A. and El Saddik, A. 2015. Evaluating and improving the depth accuracy of Kinect for Windows v2. IEEE Sensors Journal. 15, 8 (2015), 4275--4285.
[13]
Zennaro, S., Munaro, M., Milani, S., Zanuttigh, P., Bernardi, A., Ghidoni, S. and Menegatti, E. 2015. Performance evaluation of the 1st and 2nd generation Kinect for multimedia applications. Multimedia and Expo (ICME), 2015 IEEE International Conference on (2015), 1--6.

Cited By

View all
  • (2021)Artificial Vision Algorithms for Socially Assistive Robot Applications: A Review of the LiteratureSensors10.3390/s2117572821:17(5728)Online publication date: 25-Aug-2021
  • (2020)Convolutional neural network in upper limb functional motion analysis after strokePeerJ10.7717/peerj.101248(e10124)Online publication date: 9-Oct-2020

Index Terms

  1. Utilization of Color-depth Combination Features and Multi-level Refinement CNN for Upper-limb Posture Recognition

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      ICIGP '18: Proceedings of the 2018 International Conference on Image and Graphics Processing
      February 2018
      183 pages
      ISBN:9781450363679
      DOI:10.1145/3191442
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      In-Cooperation

      • Wuhan Univ.: Wuhan University, China

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 24 February 2018

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. CNN cascade
      2. RGB-D fusion
      3. posture recognition
      4. upper-limb

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      ICIGP 2018

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 23 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2021)Artificial Vision Algorithms for Socially Assistive Robot Applications: A Review of the LiteratureSensors10.3390/s2117572821:17(5728)Online publication date: 25-Aug-2021
      • (2020)Convolutional neural network in upper limb functional motion analysis after strokePeerJ10.7717/peerj.101248(e10124)Online publication date: 9-Oct-2020

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media