research-article

Vehicle Pose Estimation Based on Convolutional Neural Network

Authors:

Yao HeAuthors Info & Claims

APIT '22: Proceedings of the 2022 4th Asia Pacific Information Technology Conference

Pages 81 - 87

https://doi.org/10.1145/3512353.3512365

Published: 14 March 2022 Publication History

Abstract

To address the problems of high computational cost for vehicle pose estimation methods based on binocular vision and lidar, a monocular vision pose estimation algorithm is proposed, and a multi-task convolutional neural network is designed as a network for vehicle position and pose estimation. Regard the target vehicle as a rigid body, model its size, attitude angle, and 3D center point separately, we train the model on the ApolloSpace dataset, and analyze the trained network to estimate the calculation speed and accuracy of the model. The results show that the pose estimation network presents superior accuracy and detection speed compared to two strong baseline models.

References

[1]

Claudine Badue, Rânik Guidolini, Raphael Vivacqua Carneiro, Pedro Azevedo, Vinicius Brito Cardoso, Avelino Forechi, Luan Jesus, Rodrigo Berriel, Thiago Meireles Paixao, Filipe Mutz, 2020. Self-driving cars: A survey. Expert Systems with Applications(2020), 113816.

[2]

Kunhai Cai, Yanling Tian, Fujun Wang, Dawei Zhang, Xianping Liu, and Bijan Shirinzadeh. 2017. Design and control of a 6-degree-of-freedom precision positioning system. Robotics and Computer-Integrated Manufacturing 44 (2017), 77–96.

Digital Library

[3]

Arindam Chaudhuri, Krupa Mandaviya, Pratixa Badelia, and Soumya K Ghosh. 2017. Optical character recognition systems. In Optical Character Recognition Systems for Different Languages with Soft Computing. Springer, 9–41.

[4]

Yu-Chun Chen, Te-Feng Su, and Shang-Hong Lai. 2014. Integrated vehicle and lane detection with distance estimation. In Asian Conference on Computer Vision. Springer, 473–485.

[5]

Dawei Du, Yuankai Qi, Hongyang Yu, Yifan Yang, Kaiwen Duan, Guorong Li, Weigang Zhang, Qingming Huang, and Qi Tian. 2018. The unmanned aerial vehicle benchmark: Object detection and tracking. In Proceedings of the European Conference on Computer Vision (ECCV). 370–386.

Digital Library

[6]

Kaiwen Duan, Song Bai, Lingxi Xie, Honggang Qi, Qingming Huang, and Qi Tian. 2019. Centernet: Keypoint triplets for object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 6569–6578.

[7]

Andreas Geiger, Philip Lenz, and Raquel Urtasun. 2012. Are we ready for autonomous driving? the kitti vision benchmark suite. In 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 3354–3361.

[8]

JW Hong. 2020. Why is artificial intelligence blamed more? Analysis of faulting artificial intelligence for self-driving car accidents in experimental settings. International Journal of Human–Computer Interaction 36, 18(2020), 1768–1774.

[9]

Xinyu Huang, Xinjing Cheng, Qichuan Geng, Binbin Cao, Dingfu Zhou, Peng Wang, Yuanqing Lin, and Ruigang Yang. 2018. The apolloscape dataset for autonomous driving. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 954–960.

[10]

Jason Ku, Melissa Mozifian, Jungwook Lee, Ali Harakeh, and Steven L Waslander. 2018. Joint 3d proposal generation and object detection from view aggregation. In 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 1–8.

Digital Library

[11]

Jason Ku, Alex D Pon, Sean Walsh, and Steven L Waslander. 2019. Improving 3d object detection for pedestrians with virtual multi-view synthesis orientation estimation. In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 3459–3466.

Digital Library

[12]

Peiliang Li, Xiaozhi Chen, and Shaojie Shen. 2019. Stereo r-cnn based 3d object detection for autonomous driving. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7644–7652.

[13]

Daniel Onoro-Rubio and Roberto J López-Sastre. 2016. Towards perspective-free object counting with deep learning. In European conference on computer vision. Springer, 615–629.

[14]

Ki-Yeong Park and Sun-Young Hwang. 2014. Robust range estimation with a monocular camera for vision-based forward collision warning system. The Scientific World Journal 2014 (2014).

[15]

Francisco Pérez-Hernández, Siham Tabik, Alberto Lamas, Roberto Olmos, Hamido Fujita, and Francisco Herrera. 2020. Object detection binary classifiers methodology based on deep learning to identify small objects handled similarly: Application in video surveillance. Knowledge-Based Systems 194 (2020), 105590.

[16]

Tong Qin, Peiliang Li, and Shaojie Shen. 2018. Vins-mono: A robust and versatile monocular visual-inertial state estimator. IEEE Transactions on Robotics 34, 4 (2018), 1004–1020.

Digital Library

[17]

Zengyi Qin, Jinglu Wang, and Yan Lu. 2019. Triangulation learning network: from monocular to stereo 3d object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7615–7623.

[18]

Thomas Roddick, Alex Kendall, and Roberto Cipolla. 2018. Orthographic feature transform for monocular 3d object detection. arXiv preprint arXiv:1811.08188(2018).

[19]

Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention. Springer, 234–241.

[20]

Xibin Song, Peng Wang, Dingfu Zhou, Rui Zhu, Chenye Guan, Yuchao Dai, Hao Su, Hongdong Li, and Ruigang Yang. 2019. Apollocar3d: A large 3d car instance understanding benchmark for autonomous driving. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5452–5462.

[21]

Xudong Sun, Pengcheng Wu, and Steven CH Hoi. 2018. Face detection using deep learning: An improved faster RCNN approach. Neurocomputing 299(2018), 42–50.

[22]

Mingxing Tan and Quoc Le. 2019. Efficientnet: Rethinking model scaling for convolutional neural networks. In International Conference on Machine Learning. PMLR, 6105–6114.

[23]

Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. 2018. Cbam: Convolutional block attention module. In Proceedings of the European conference on computer vision (ECCV). 3–19.

Digital Library

[24]

Zhengyou Zhang. 2000. A flexible new technique for camera calibration. IEEE Transactions on pattern analysis and machine intelligence 22, 11(2000), 1330–1334.

Digital Library

Recommendations

Camera Pose Estimation Method Based on Deep Neural Network
ICDLT '19: Proceedings of the 2019 3rd International Conference on Deep Learning Technologies

This paper proposes a camera pose estimation algorithm based on deep neural network, which returns the translation and rotation of the camera based on supervised deep learning. This paper uses the ORB algorithm to extract the feature points of the image,...
Scene text recognition using residual convolutional recurrent neural network

Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The ...
Deep probabilistic human pose estimation

The authors consider the problem of human pose estimation using probabilistic convolutional neural networks. They explore ways to improve human pose estimation accuracy on standard pose estimation benchmarks MPII human pose and Leeds Sports Pose (LSP) ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

APIT '22: Proceedings of the 2022 4th Asia Pacific Information Technology Conference

January 2022

239 pages

ISBN:9781450395571

DOI:10.1145/3512353

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 March 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

APIT 2022

APIT 2022: 2022 4th Asia Pacific Information Technology Conference

January 14 - 16, 2022

Virtual Event, Thailand

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
82
Total Downloads

Downloads (Last 12 months)10
Downloads (Last 6 weeks)1

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten