Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3409501.3409537acmotherconferencesArticle/Chapter ViewAbstractPublication PageshpcctConference Proceedingsconference-collections
research-article

Human outline keypoints detecting via global and grouping strategy

Published: 25 August 2020 Publication History

Abstract

Different from human's pose estimation, the outline keypoints detecting task has not yet long been researched sufficiently in computer vision field. Body's outline cannot be directly recovered with joint keypoints or skeleton only, even with the aid of semantic segmentation. Detecting points of human's outline is still a challenging and relatively new work which aims at describing the outline shape of a human being with ordered keypoints. Moreover, the estimation must be robust with interference, such as self-occlusion or complicated background. By analyzing the characters of the task, we put forward global and grouping strategy. Based on this, we introduce a method to regress 63 keypoints in real-time with outstanding capability even in mobile device. Experimental results show that the proposed model has excellent state-of-the-art performance over traditional pose estimation models.

References

[1]
Yilun Chen, Zhicheng Wang, Yuxiang Peng, Zhiqiang Zhang, Gang Yu, and Jian Sun. 2017. Cascaded Pyramid Network for Multi-Person Pose Estimation. arXiv: Computer Vision and Pattern Recognition (2017).
[2]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Deep Residual Learning for Image Recognition. arXiv: Computer Vision and Pattern Recognition (2015).
[3]
Andrew Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, M Andreetto, and Hartwig Adam. 2017. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv: Computer Vision and Pattern Recognition (2017).
[4]
Gao Huang, Shichen Liu, Laurens Van Der Maaten, and Kilian Q Weinberger. 2017. CondenseNet: An Efficient DenseNet using Learned Group Convolutions. arXiv: Computer Vision and Pattern Recognition (2017).
[5]
E. Shelhamer J. Long and T. Darrell. 2015. Fully convolutional networks for semantic segmentation. CVPR (2015).
[6]
Alex Krizhevsky, I. Sutskever, and G. Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. Advances in neural information processing systems 25, 2 (2012).
[7]
F. Schroff L.-C. Chen, G. Papandreou and H. Adam. 2017. Rethinking atrous convolution for semantic image segmentation. arXiv (2017).
[8]
Tsungyi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollar, and C Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. (2014), 740--755.
[9]
Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2017. Feature pyramid networks for object detection. (2017), 2117--2125.
[10]
Alejandro Newell, Kaiyu Yang, and Jia Deng. 2016. Stacked Hourglass Networks for Human Pose Estimation. arXiv: Computer Vision and Pattern Recognition (2016).
[11]
P. Fischer O. Ronneberger and T. Brox. 2015. U-net: Convolutionalnetworks for biomedical image segmentation. MICCAI (2015).
[12]
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael S Bernstein, et al. 2015. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision 115, 3 (2015), 211--252.
[13]
Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liangchieh Chen. 2018. MobileNetV2: Inverted Residuals and Linear Bottlenecks. arXiv: Computer Vision and Pattern Recognition (2018).
[14]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. international conference on learning representations (2015).
[15]
Ke Sun, Bin Xiao, Dong Liu, and Jingdong Wang. 2019. Deep High-Resolution Representation Learning for Human Pose Estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5693--5703. https://academic.microsoft.com/paper/2916798096
[16]
Christian Szegedy,Wei Liu, Yangqing Jia, Pierre Sermanet, Scott E Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. computer vision and pattern recognition (2015), 1--9.
[17]
Jonathan Tompson, Ross Goroshin, Arjun Jain, Yann Lecun, and Christoph Bregler. 2014. Efficient Object Localization Using Convolutional Networks. arXiv: Computer Vision and Pattern Recognition (2014).
[18]
Alexander Toshev and Christian Szegedy. 2014. DeepPose: Human Pose Estimation via Deep Neural Networks. computer vision and pattern recognition (2014) 1653--1660.
[19]
Shih En Wei, Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh. 2016. Convolutional Pose Machines. (2016).
[20]
JiahongWu, He Zheng, Bo Zhao, Yixin Li, Baoming Yan, Rui Liang,WenjiaWang, Shipei Zhou, Guosen Lin, and Yanwei Fu. [n.d.]. AI Challenger: A Large-scale Dataset for Going Deeper in Image Understanding. ([n. d.]).
[21]
Bin Xiao, Haiping Wu, and Yichen Wei. 2018. Simple Baselines for Human Pose Estimation and Tracking. arXiv: Computer Vision and Pattern Recognition (2018).

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
HPCCT & BDAI '20: Proceedings of the 2020 4th High Performance Computing and Cluster Technologies Conference & 2020 3rd International Conference on Big Data and Artificial Intelligence
July 2020
276 pages
ISBN:9781450375603
DOI:10.1145/3409501
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • Xi'an Jiaotong-Liverpool University: Xi'an Jiaotong-Liverpool University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 August 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Pose estimation
  2. human outline keypoints detection
  3. rigid restriction

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

HPCCT & BDAI 2020

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 28
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 29 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media