Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3579654.3579734acmotherconferencesArticle/Chapter ViewAbstractPublication PagesacaiConference Proceedingsconference-collections
research-article

DTTrack: Target Tracking Algorithm Combining DaSiamRPN Tracker and Transformer Tracker

Published: 14 March 2023 Publication History

Abstract

At present, transformer-based target tracking algorithms mainly use transformers to fuse deep convolution features, their tracking accuracy is competitive, however compared with convolutional neural networks, their tracking speed is slow. Due to the long-distance dependence characteristics, it is difficult to obtain rich local information when extracting visual features, the tracking results may become worse, or even the tracking may fail in the later tracking procedures. The partial target tracking algorithm based on the Siamese network has great advantages in extracting local information, however its tracking accuracy cannot fully reach the transformer-based target tracking algorithm. According to the characteristics of the two trackers, combining the response scores and Hamming distance which is used to calculate the similarity, then a target tracking algorithm combining DaSiamRPN and Transformer is proposed. This structure can judge whether the tracking effect of the transformer tracker has deteriorated according to the response score and the Hamming distance between the resulting frame and the initial frame during transformer tracking, in order to replace another tracker in time. The proposed method can reduce the drift and obtain higher accuracy as well. Experiments show that our tracker achieves good results on three datasets. Our method achieved 72.0%, 69.1%, and 67.1% success rates on the GOT-10k, OTB2015, and UAV123 datasets, respectively.

References

[1]
Diana B Aglyamutdinova, Rail R Mazgutov, and Boris V Vishnyakov. 2018. Object localization for subsequent UAV tracking. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci 42, 2 (2018).
[2]
Luca Bertinetto, Jack Valmadre, Joao F Henriques, Andrea Vedaldi, and Philip HS Torr. 2016. Fully-convolutional siamese networks for object tracking. In European conference on computer vision. Springer, 850–865.
[3]
Zhaowei Cai and Nuno Vasconcelos. 2018. Cascade r-cnn: Delving into high quality object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 6154–6162.
[4]
Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. 2020. End-to-end object detection with transformers. In European conference on computer vision. Springer, 213–229.
[5]
Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, and Huchuan Lu. 2021. Transformer tracking. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8126–8135.
[6]
Yutao Cui, Cheng Jiang, Limin Wang, and Gangshan Wu. 2022. MixFormer: End-to-End Tracking With Iterative Mixed Attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 13608–13618.
[7]
Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, and Michael Felsberg. 2019. Atom: Accurate tracking by overlap maximization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4660–4669.
[8]
Martin Danelljan, Luc Van Gool, and Radu Timofte. 2020. Probabilistic regression for visual tracking. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 7183–7192.
[9]
Ross Girshick. 2015. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision. 1440–1448.
[10]
Kai Han, An Xiao, Enhua Wu, Jianyuan Guo, Chunjing Xu, and Yunhe Wang. 2021. Transformer in transformer. Advances in Neural Information Processing Systems 34 (2021), 15908–15919.
[11]
Lianghua Huang, Xin Zhao, and Kaiqi Huang. 2019. Got-10k: A large high-diversity benchmark for generic object tracking in the wild. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 5(2019), 1562–1577.
[12]
Bo Li, Wei Wu, Qiang Wang, Fangyi Zhang, Junliang Xing, and Junjie Yan. 2019. Siamrpn++: Evolution of siamese visual tracking with very deep networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4282–4291.
[13]
Bo Li, Junjie Yan, Wei Wu, Zheng Zhu, and Xiaolin Hu. 2018. High performance visual tracking with siamese region proposal network. In Proceedings of the IEEE conference on computer vision and pattern recognition. 8971–8980.
[14]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems 28 (2015).
[15]
Ran Tao, Efstratios Gavves, and Arnold WM Smeulders. 2016. Siamese instance search for tracking. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1420–1429.
[16]
Paul Voigtlaender, Jonathon Luiten, Philip HS Torr, and Bastian Leibe. 2020. Siam r-cnn: Visual tracking by re-detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 6578–6588.
[17]
Thang Vu, Hyunjun Jang, Trung X Pham, and Chang Yoo. 2019. Cascade rpn: Delving into high-quality region proposal network with adaptive convolution. Advances in neural information processing systems 32 (2019).
[18]
Guangting Wang, Chong Luo, Zhiwei Xiong, and Wenjun Zeng. 2019. Spm-tracker: Series-parallel matching for real-time visual object tracking. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 3643–3652.
[19]
Ning Wang, Wengang Zhou, Jie Wang, and Houqiang Li. 2021. Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 1571–1580.
[20]
Bin Yan, Houwen Peng, Jianlong Fu, Dong Wang, and Huchuan Lu. 2021. Learning spatio-temporal transformer for visual tracking. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 10448–10457.
[21]
Song Yan, Jinyu Yang, Jani Käpylä, Feng Zheng, Aleš Leonardis, and Joni-Kristian Kämäräinen. 2021. Depthtrack: Unveiling the power of rgbd tracking. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 10725–10733.
[22]
Zhipeng Zhang, Houwen Peng, Jianlong Fu, Bing Li, and Weiming Hu. 2020. Ocean: Object-aware anchor-free tracking. In European Conference on Computer Vision. Springer, 771–787.
[23]
Shaochuan Zhao, Tianyang Xu, Xiao-Jun Wu, and Xue-Feng Zhu. 2021. Adaptive feature fusion for visual object tracking. Pattern Recognition 111(2021), 107679.
[24]
Zheng Zhu, Qiang Wang, Bo Li, Wei Wu, Junjie Yan, and Weiming Hu. 2018. Distractor-aware siamese networks for visual object tracking. In Proceedings of the European conference on computer vision (ECCV). 101–117.

Cited By

View all
  • (2024)CT-DETR and ReID-Guided Multi-Target Tracking Algorithm in Complex ScenesIECE Transactions on Emerging Topics in Artificial Intelligence10.62762/TETAI.2024.2405291:1(44-57)Online publication date: 29-May-2024

Index Terms

  1. DTTrack: Target Tracking Algorithm Combining DaSiamRPN Tracker and Transformer Tracker

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ACAI '22: Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence
    December 2022
    770 pages
    ISBN:9781450398336
    DOI:10.1145/3579654
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 March 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. DaSiamRPN
    2. Hamming distance
    3. Similarity
    4. Tracker
    5. Transt

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ACAI 2022

    Acceptance Rates

    Overall Acceptance Rate 173 of 395 submissions, 44%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 30 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)CT-DETR and ReID-Guided Multi-Target Tracking Algorithm in Complex ScenesIECE Transactions on Emerging Topics in Artificial Intelligence10.62762/TETAI.2024.2405291:1(44-57)Online publication date: 29-May-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media