research-article

Study on Multi-Pedestrian Trajectory Tracking based on improved YOLOv5+DeepSort

Authors:

Chuanying Yang,

Shaoying MaAuthors Info & Claims

ICCVIT '23: Proceedings of the 2023 International Conference on Computer, Vision and Intelligent Technology

Article No.: 50, Pages 1 - 10

https://doi.org/10.1145/3627341.3630406

Published: 15 December 2023 Publication History

Abstract

A multi-object tracking algorithm based on improved YOLOv5+Deepsort is proposed to improve the tracking effect in crowded and fuzzy scenes. The algorithm is improved as follows: the SKNet visual attention mechanism is integrated into the Backbone of YOLOv5 to strengthen the ability of recognizing fuzzy crowded groups; the FPN+PAN structure of the feature fusion module of YOLOv5 is replaced with the BiFPN structure to achieve efficient bidirectional cross-scale connectivity and weighted feature fusion;finally,constantly velocity model in the Kalman Filter is replaced with the constantly acceleration model to optimize the pedestrian motion model,and use DIOU quadratic matching to match detection frames that are not matched successfully,to improve the DeepSort tracking performance. The experimental results show that the accuracy on MOT17 is improved by 5.20% and the precision is improved by 1.85%; the accuracy on MOT20 is improved by 4.09% and the precision is improved by 1.33%.

References

[1]

Licheng Jiao, Dan Wang, Yidong Bai, Puhua Chen, Fang Liu. 2021. Deep Learning in Visual Tracking: A Review. In Proceedings of the 2021 IEEE Transactions on Neural Networks and Learning Systems. IEEE, 1-20. https://doi.org/10.1109/TNNLS.2021.3136907

[2]

Tim Meinhardt, Alexander Kirillov, Laura Leal-Taixé and Christoph Feichtenhofer. 2022. TrackFormer: Multi-Object Tracking with Transformers. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New Orleans, LA, 8834-8844, https://doi.org/10.1109/CVPR52688.2022.00864

[3]

Xiaomin Huang, Shaolin Hu, Qiliang Guo. 2021. Multi-Object Recognition Based on Improved YOLOv4. In Proceedings of the 2021 CAA Symposium on Fault Detection, Supervision, and Safety for Technical Processes (SAFEPROCESS), Chengdu,1-4. https://doi.org/10.1109/SAFEPROCESS52771.2021.9693717

[4]

Liu Yu. 2018. An improved Faster R-CNN for object detection. In Proceedings of 11th International Symposium on Computer Intelligence and Design(ISCID), Hangzhou, DEC 08-09,119-123. https://doi.org/10.1109/ISCID.2018.10128

[5]

Alex BEWLEY, Zongyuan Ge, Lionel OTT L, Fabio Ramos, Ben Upcroft. 2016. Simple online and realtime tracking. In Proceedings of the 2016 IEEE International Conference on Image Processing(ICIP). IEEE, Phoenix, AZ, 3464-3468. https://doi.org/10.1109/ICIP.2016.7533003

[6]

Nicolai Wojke, Alex Bewley, Dietrich Paulus. 2017. Simple online and realtime tracking with a deep association metric. In Proceedings of the 2017 IEEE International Conference on Image Processing. IEEE, Beijing, 3645-3649. https://doi.org/10.1109/ICIP.2017.8296962

Digital Library

[7]

Zhongdao Wang, Liang Zheng, Yixuan Liu, Yali Li, Shengjin Wang. 2019. Towards real-rime multi-object tracking. arXiv:1909.12605. Retrieved from https://arxiv.org/abs/1909.12605

[8]

Jiaqi Peng, Tao Wang, Kean Chen, Weiyao Lin. 2022. Spatio-temporal consistency based FairMOT tracking algorithm optimization. Journal of Image and Graphics, (Septembe 2022), V27: 2749-2760

[9]

Xingyi Zhou, Vladlen Koltun, Philipp Krähenbühl. 2020. Tracking objects as points. In Proceedings of the 2020 Computer Vision and Pattern Recognition. Cham:Springer, 474-490. https://doi.org/10.48550/arXiv.2004.01177

Digital Library

[10]

Yong Li, Naipeng Miao, Liangdi Ma, Feng Shuang and Xingwen Huang. 2023. Transformer for object detection: Review and benchmark. Engineering Applications of Artificial Intelligence, (Received October 2022, Revised May 2023, Accepted August 2023). https://doi.org/10.1016/j.engappai.2023.107021

Digital Library

[11]

Avinash Kalyanaraman, Erin Griffiths, Kamin Whitehouse. 2016. TransTrack: Tracking Multiple Targets by Sensing Their Zone Transitions. In the International Conference on Distributed Computing in Sensor Systems (DCOSS). IEEE. Washington, DC, 59-66. https://doi.org/10.1109/DCOSS.2016.27

[12]

Tim Meinhardt, Alexander Kirillov, Laura Leal-Taixe, Christoph Feichtenhofer. 2022. TrackFormer:Multi-Object with Transformer. In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR). IEEE, New Prleans, LA, 18-24. https://doi.org/10.1109/CVPR52688.2022.00864

[13]

Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi. 2016. You only look once: unified,real-time object detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, Las Vegas, NV, 779-788. https://doi.org/10.1109/CVPR.2016.91

[14]

Zibo Feng, Fucheng You. 2021. Research on Video Processing Based on YOLOv3 Improved Algorithm. EITCE '20: Proceedings of the 2020 4th International Conference on Electronic Information Technology and Computer Engineering.(November 2020), 591–595. https://doi.org/10.1145/3443467.3443821

Digital Library

[15]

Joseph Redmon, Ali Farhadi. 2017. YOLO9000: Better, Faster, Stronger. In Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR). IEEE, Honolulu, HI, 6517-6525. https://doi.org/10.1109/CVPR.2017.690

[16]

Chien-Yao Wang, Hong-Yuan Mark Liao, 2020. CSPNet: A new backbone that can enhance learning capability of CNN. In Proceedings of the 2020 IEEE/CVF conference on computer vision and pattern recognition workshops. IEEE, Seattle, WA, 390-391. https://doi.org/10.1109/CVPRW50498.2020.00203

[17]

Shu Liu, Lu Qi, Haifang Qin, Jianping Shi, Jiaya Jia. 2018. Path aggregation network for instance segmentation.In Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, 8759-8768,https://doi.org/10.1109/CVPR.2018.00913

[18]

Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, 2019. Generalized intersection over union: A metric and a loss for bounding box regression. In the Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.IEEE, Long Beach, CA, 658-666, https://doi.org/10.1109/CVPR.2019.00075

[19]

Jiahui Yu, Yuning Jiang, Zhangyang Wang, Zhimin Cao, Thomas Huangl. 2016. Unitbox: An advanced object detection network. Proceedings of the 24th ACM international conference on Multimedia.(October 2016), 516–520. https://doi.org/10.1145/2964284.2967274

Digital Library

[20]

Hasiqidalatu Tang, Jiaxin Cai. 2022. A Survey on Human Action Recognition based on Attention Mechanism. In Proceedings of the 2022 7th International Conference on Intelligent Information Technology, (February 2022), 46–51. https://doi.org/10.1145/3524889.3524897

Digital Library

[21]

Xiang Li, Wenhai Wang, Xiaolin Hu, Jian Yang. 2019. Selective kernel networks. In Proceedings of the 2019 IEEE/CVF conference on computer vision and pattern recognition( CVPR ). IEEE, Long Beach, CA, 510−519. https://doi.org/10.1109/CVPR.2019.00060

[22]

Mingxing Tan, Ruoming Pang, Quoc V.Le. 2020. EfficientDet: Scalable and Efficient Object Detection. In Proceedings of the 2020 IEEE Conference on Computer Vision and Pattern Recognition. Seattle, WA, 10778-10787. https://doi.org/10.1109/CVPR42600.2020.01079

[23]

Roman Korkin, Ivan Oseledets, Aleksandr Katrutsa. 2023.Multiparticle Kalman filter for object localization in symmetric environments. Expert Systems with Applications, (Received March 2023, Revised August 2023, Accepted August 2023). https://doi.org/10.1016/j.eswa.2023.121408

Digital Library

Index Terms

Study on Multi-Pedestrian Trajectory Tracking based on improved YOLOv5+DeepSort
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Tracking
      2. Computer vision tasks
        Scene understanding
  2. Machine learning

Index terms have been assigned to the content through auto-classification.

Recommendations

Robust pedestrian tracking using improved tracking-learning-detection algorithm
ICVGIP '16: Proceedings of the Tenth Indian Conference on Computer Vision, Graphics and Image Processing

Manual analysis of pedestrians for surveillance of large crowds in real time applications is not practical. Tracking-Learning-Detection suggested by Kalal, Mikolajczyk and Matas [1] is one of the most prominent automatic object tracking system. TLD can ...
Real-time detection and tracking of fish abnormal behavior based on improved YOLOV5 and SiamRPN++
Highlights
- A detection and tracking model combining Yolov5 and Siamrpn++ was proposed.
- It ...
Abstract
In recirculating aquaculture system, the abnormal behavior of fish is usually caused by poor water quality, hypoxia or diseases. Delayed recognition of this behavior will lead to large number of fish deaths. Thus, real-time detection ...
Moving vehicle tracking based on improved tracking–learning–detection algorithm

This study addresses the tracking–learning–detection (TLD) algorithm for long‐term single‐target tracking of moving vehicle from video streams. The problems leading to tracking failures in existing TLD methods are discovered, and an improved TLD (ITLD) ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICCVIT '23: Proceedings of the 2023 International Conference on Computer, Vision and Intelligent Technology

August 2023

378 pages

ISBN:9798400708701

DOI:10.1145/3627341

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 December 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICCVIT 2023

ICCVIT 2023: International Conference on Computer, Vision and Intelligent Technology

August 25 - 28, 2023

Chenzhou, China

Acceptance Rates

ICCVIT '23 Paper Acceptance Rate 54 of 142 submissions, 38%;

Overall Acceptance Rate 54 of 142 submissions, 38%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
19
Total Downloads

Downloads (Last 12 months)19
Downloads (Last 6 weeks)1

Reflects downloads up to 01 Oct 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents