research-article

Human outline keypoints detecting via global and grouping strategy

Authors:

Yue WuAuthors Info & Claims

HPCCT & BDAI '20: Proceedings of the 2020 4th High Performance Computing and Cluster Technologies Conference & 2020 3rd International Conference on Big Data and Artificial Intelligence

Pages 137 - 144

https://doi.org/10.1145/3409501.3409537

Published: 25 August 2020 Publication History

Abstract

Different from human's pose estimation, the outline keypoints detecting task has not yet long been researched sufficiently in computer vision field. Body's outline cannot be directly recovered with joint keypoints or skeleton only, even with the aid of semantic segmentation. Detecting points of human's outline is still a challenging and relatively new work which aims at describing the outline shape of a human being with ordered keypoints. Moreover, the estimation must be robust with interference, such as self-occlusion or complicated background. By analyzing the characters of the task, we put forward global and grouping strategy. Based on this, we introduce a method to regress 63 keypoints in real-time with outstanding capability even in mobile device. Experimental results show that the proposed model has excellent state-of-the-art performance over traditional pose estimation models.

References

[1]

Yilun Chen, Zhicheng Wang, Yuxiang Peng, Zhiqiang Zhang, Gang Yu, and Jian Sun. 2017. Cascaded Pyramid Network for Multi-Person Pose Estimation. arXiv: Computer Vision and Pattern Recognition (2017).

[2]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Deep Residual Learning for Image Recognition. arXiv: Computer Vision and Pattern Recognition (2015).

[3]

Andrew Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, M Andreetto, and Hartwig Adam. 2017. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv: Computer Vision and Pattern Recognition (2017).

[4]

Gao Huang, Shichen Liu, Laurens Van Der Maaten, and Kilian Q Weinberger. 2017. CondenseNet: An Efficient DenseNet using Learned Group Convolutions. arXiv: Computer Vision and Pattern Recognition (2017).

[5]

E. Shelhamer J. Long and T. Darrell. 2015. Fully convolutional networks for semantic segmentation. CVPR (2015).

[6]

Alex Krizhevsky, I. Sutskever, and G. Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. Advances in neural information processing systems 25, 2 (2012).

[7]

F. Schroff L.-C. Chen, G. Papandreou and H. Adam. 2017. Rethinking atrous convolution for semantic image segmentation. arXiv (2017).

[8]

Tsungyi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollar, and C Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. (2014), 740--755.

[9]

Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2017. Feature pyramid networks for object detection. (2017), 2117--2125.

[10]

Alejandro Newell, Kaiyu Yang, and Jia Deng. 2016. Stacked Hourglass Networks for Human Pose Estimation. arXiv: Computer Vision and Pattern Recognition (2016).

[11]

P. Fischer O. Ronneberger and T. Brox. 2015. U-net: Convolutionalnetworks for biomedical image segmentation. MICCAI (2015).

[12]

Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael S Bernstein, et al. 2015. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision 115, 3 (2015), 211--252.

Digital Library

[13]

Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liangchieh Chen. 2018. MobileNetV2: Inverted Residuals and Linear Bottlenecks. arXiv: Computer Vision and Pattern Recognition (2018).

[14]

Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. international conference on learning representations (2015).

[15]

Ke Sun, Bin Xiao, Dong Liu, and Jingdong Wang. 2019. Deep High-Resolution Representation Learning for Human Pose Estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5693--5703. https://academic.microsoft.com/paper/2916798096

[16]

Christian Szegedy,Wei Liu, Yangqing Jia, Pierre Sermanet, Scott E Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. computer vision and pattern recognition (2015), 1--9.

[17]

Jonathan Tompson, Ross Goroshin, Arjun Jain, Yann Lecun, and Christoph Bregler. 2014. Efficient Object Localization Using Convolutional Networks. arXiv: Computer Vision and Pattern Recognition (2014).

[18]

Alexander Toshev and Christian Szegedy. 2014. DeepPose: Human Pose Estimation via Deep Neural Networks. computer vision and pattern recognition (2014) 1653--1660.

[19]

Shih En Wei, Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh. 2016. Convolutional Pose Machines. (2016).

[20]

JiahongWu, He Zheng, Bo Zhao, Yixin Li, Baoming Yan, Rui Liang,WenjiaWang, Shipei Zhou, Guosen Lin, and Yanwei Fu. [n.d.]. AI Challenger: A Large-scale Dataset for Going Deeper in Image Understanding. ([n. d.]).

[21]

Bin Xiao, Haiping Wu, and Yichen Wei. 2018. Simple Baselines for Human Pose Estimation and Tracking. arXiv: Computer Vision and Pattern Recognition (2018).

Index Terms

Human outline keypoints detecting via global and grouping strategy
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Interest point and salient region detections
      2. Computer vision tasks
        Scene understanding

Recommendations

A local-global coupled-layer puppet model for robust online human pose tracking

We propose a new method for online tracking of articulated human body poses.Our method offers online sequential tracking from one frame to the next.Many other methods mutually optimize poses offline over all frames of a sequence.We propose a novel cross-...
Vision-based human pose estimation for pervasive computing
AMC '09: Proceedings of the 2009 workshop on Ambient media computing

Vision-based human pose estimation is useful in pervasive computing. In this paper, we proposed an example-based approach to human pose estimation from monocular image sequences. We use human motion capture data to synthesize a pose example database ...
Pose determination of human faces by using vanishing points

A new method for estimating 3D-head pose from a monocular image is proposed in this paper. The approach employs general prior knowledge of face structure and the corresponding geometrical constraints provided by the location of vanishing point to ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

HPCCT & BDAI '20: Proceedings of the 2020 4th High Performance Computing and Cluster Technologies Conference & 2020 3rd International Conference on Big Data and Artificial Intelligence

July 2020

276 pages

ISBN:9781450375603

DOI:10.1145/3409501

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Xi'an Jiaotong-Liverpool University: Xi'an Jiaotong-Liverpool University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 August 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

HPCCT & BDAI 2020

HPCCT & BDAI 2020: 2020 4th High Performance Computing and Cluster Technologies Conference & 2020 3rd International Conference on Big Data and Artificial Intelligence

July 3 - 6, 2020

Qingdao, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
28
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 29 Sep 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents