Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3652628.3652701acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicaiceConference Proceedingsconference-collections
research-article

A Global-Part-Local Approach for 3D Human Pose Estimation from Single-View Images

Published: 23 May 2024 Publication History

Abstract

Significant progress has been made in 3D human pose estimation (HPE) from monocular images. Previous research has shown that in the process of 3D HPE, global information provides an overall spatial structure and a rough layout of the pose. In contrast, local information plays a crucial role in precisely locating specific body parts. The combined consideration of both aspects proves advantageous for effectively acquiring a comprehensive representation of the human skeleton. However, the human body possesses an intrinsic topological structure, which gives rise to pose estimation errors. These errors have a tendency to propagate within the body's interconnected framework, causing them to accumulate predominantly at the distal joints. This accumulation, over time, culminates in a discernible reduction in the overall accuracy of pose estimation. To address this issue, we propose a method termed GlPaLo (Global-Part-Local), which integrates global, part-level, and local information. It consists of two key modules: uMLPGraph algorithm and BPConstraint module. GlPaLo aims to capture global, part, and local information among human keypoints to improve the accuracy of 3D HPE. Our uMLPGraph algorithm module consists of a multi-layer perceptron with a U-shaped structure (uMLP) and a graph convolutional network (GCN), which is used to simultaneously process both global and local information. The BPConstraint module is divided into body-level constraints and part-level constraints, aiming to learn constraint information about the human body structure. At the part-level constraints, introducing parent node features as prior knowledge helps to reduce the accumulation of errors at the end joints of the human body. Extensive experiments conducted on the Human3.6M and MPI-INF-3DHP datasets demonstrate that the proposed method achieves outstanding performance.

References

[1]
Liu J, Zhang L. Indoor privacy-preserving action recognition via partially coupled convolutional neural network[C]//2020 International Conference on Artificial Intelligence and Computer Engineering (ICAICE). IEEE, 2020: 292-295.
[2]
Li Y, Liu P, Xiang R, Research Progress of Monitoring System Based on Wearable Fall Detection Equipment for The Elderly[C]//2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE). IEEE, 2021: 6-10.
[3]
Teng Z, Xu Z, Zhao Y. A human-machine interaction method and process based on the intelligent medical robot[C]//Third International Conference on Artificial Intelligence and Computer Engineering (ICAICE 2022). SPIE, 2023, 12610: 156-163.
[4]
Xing B, Li H, Chen N. Research on aerobics action pose recognition based on deep learning[C]//Third International Conference on Artificial Intelligence and Computer Engineering (ICAICE 2022). SPIE, 2023, 12610: 801-806.
[5]
Zhang K, Zhang Y. SmartPose: An Intelligent Household Appliances Controller Based on Visual Recognition of Human Postures[C]//2020 International Conference on Artificial Intelligence and Computer Engineering (ICAICE). IEEE, 2020: 218-223.
[6]
Zou Z, Tang W. Modulated graph convolutional network for 3D human pose estimation[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2021: 11477-11487.
[7]
Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, and Dimitris N Metaxas. Semantic graph convolutional networks for 3d human pose regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3425–3435, 2019.
[8]
Tianhan Xu and Wataru Takano. Graph stacked hourglass networks for 3d human pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16105–16114, 2021.
[9]
K. Liu, R. Ding, Z. Zou, L. Wang, and W. Tang, “A comprehensive study of weight sharing in graph networks for 3D human pose estimation,” in ECCV, 2020, pp. 318–334
[10]
Li, Wenhao, "Mhformer: Multi-hypothesis transformer for 3d human pose estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.
[11]
I. O. Tolstikhin, N. Houlsby, A. Kolesnikov, L. Beyer, X. Zhai, T. Unterthiner, J. Yung, A. Steiner, D. Keysers, J. Uszkoreit, “MLP-Mixer: An all-MLP architecture for vision,” vol. 34, 2021, pp. 24 261–24 272.
[12]
Vaswani, Ashish, "Attention is all you need." Advances in neural information processing systems 30. 2017.
[13]
Nair, Vinod, and Geoffrey E. Hinton. "Rectified linear units improve restricted boltzmann machines." Proceedings of the 27th international conference on machine learning (ICML-10). 2010.
[14]
Ionescu, Catalin, "Human3. 6m: Large scale datasets and predictive methods for 3d human sensing in natural environments." IEEE transactions on pattern analysis and machine intelligence 36.7. 2013: 1325-1339.
[15]
Dushyant Mehta, Helge Rhodin, Dan Casas, Pascal Fua, Oleksandr Sotnychenko, Weipeng Xu, and Christian Theobalt, “Monocular 3D human pose estimation in the wild using improved CNN supervision,” in 3DV, 2017, pp. 506–516.
[16]
Chen, Yilun, "Cascaded pyramid network for multi-person pose estimation." Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.
[17]
Julieta Martinez, Rayat Hossain, Javier Romero, and James J Little. A simple yet effective baseline for 3d human pose estimation. In Proceedings of the IEEE International Conference on Computer Vision, pages 2640–2649, 2017.
[18]
Hao-Shu Fang, Yuanlu Xu, Wenguan Wang, Xiaobai Liu, and Song-Chun Zhu. Learning pose grammar to encode human body configuration for 3d pose estimation. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 32, 2018.
[19]
W. Yang, W. Ouyang, X. Wang, J. Ren, H. Li, and X. Wang, “3D human pose estimation in the wild by adversarial learning,” in CVPR, 2018, pp. 5255–5264.
[20]
Sharma, Saurabh, "Monocular 3d human pose estimation by generation and ordinal ranking." Proceedings of the IEEE/CVF international conference on computer vision. 2019.
[21]
Georgios Pavlakos, Xiaowei Zhou, and Kostas Daniilidis. Ordinal depth supervision for 3d human pose estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 7307–7316, 2018.
[22]
Hai Ci, Chunyu Wang, Xiaoxuan Ma, and Yizhou Wang. Optimizing network structure for 3d human pose estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 2262–2271, 2019.
[23]
Ailing Zeng, Xiao Sun, Fuyang Huang, Minhao Liu, Qiang Xu, and Stephen Lin, “SRNet: Improving generalization in 3D human pose estimation with a split-and-recombine approach,” in ECCV, 2020, pp. 507–523.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICAICE '23: Proceedings of the 4th International Conference on Artificial Intelligence and Computer Engineering
November 2023
1263 pages
ISBN:9798400708831
DOI:10.1145/3652628
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 May 2024

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICAICE 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 8
    Total Downloads
  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)4
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media