Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3654823.3654840acmotherconferencesArticle/Chapter ViewAbstractPublication PagescacmlConference Proceedingsconference-collections
research-article

Pedestrian Detection Algorithm of YOLOV8 Based on Feature Enhancement

Published: 29 May 2024 Publication History

Abstract

In an effort to overcome the problem of insufficient network ability of YOLOV8 backbone feature extraction and poor feature fusion ability, a C2f module based on attention mechanism is introduced to carry to a new and higher level the model’s ability to express input features. Aiming at the feature fusion in improved YOLOV8, this paper first modifies the structure of FPN in order to effectively integrate the feature information extracted by backbone, so as to enhance the feature fusion ability of the model. In addition, this paper designs a new feature fusion structure to replace the existing PAN structure, make the feature map information input by FPN more reasonably utilized, so as to improve the overall detection accuracy of the model. The experimental results show that the improved YOLOV8 algorithm performs well on the CUHK dataset [email protected] Improved by 1.7%, and the detection speed remained basically unchanged, with excellent performance on WiderPerson, [email protected] Reaching 91.3%, it can quickly and accurately achieve pedestrian detection. In addition, this paper conducts relevant ablation experiments and comparative experiments to validate the effectiveness of each proposed improvement.

References

[1]
S Devi, Kowshik Thopalli, P Malarvezhi, and Jayaraman J. Thiagarajan. 2022. Improving Single-Stage Object Detectors for Nighttime Pedestrian Detection. International Journal of Pattern Recognition and Artificial Intelligence 36, 09 (2022), 2250034.
[2]
Ross Girshick. 2015. Fast R-CNN. In Proc. of the IEEE International Conference on Computer Vision. 1440–1448.
[3]
Hui Liu, Li Peng, and Jiwei Wen. 2018. Multi-Scale Aware Pedestrian Detection Algorithm Based on Improved Full Convolutional Network. Laser & Optoelectronics Progress 55, 9 (2018), 318–324.
[4]
Luming Gong, Meihua Xu, DongJun Liu, and Fayu Zhang. 2018. Novel Model of Pedestrian Detection Based on Gaussian Mixture Model and HOG+SVM. Journal of Shanghai University (Natural Science) 24, 3 (2018), 341–351.
[5]
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In Proc. of the IEEE Conference on Computer Vision and Pattern Recognition. 779–788.
[6]
Joseph Redmon and Ali Farhadi. 2017. YOLO9000: Better, Faster, Stronger. In Proc. of IEEE Conference on Computer Vision and Pattern Recognition. 6517–6525.
[7]
Ross Girshick, Jeff Donahue, Trevor Darrell, and Jitendra Malik. 2014. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In Proc. of the IEEE Conference on Computer Vision and Pattern Recognition. 580–587.
[8]
XianXian Tian, Hong Bao, and Cheng Xu. 2014. Improved HOG Algorithm of Pedestrian Detection. Computer Science 41, 9 (2014), 320–324.
[9]
Danfeng Wang, Chaobo Chen, TianLi Ma, Changhong Li, and Chunyu Miao. 2020. YOLOV3 Pedestrian Detection Algorithmbased on Depth-Wish Separable Convolution. Computer Applications and Software 37, 6 (2020), 218–223.
[10]
Dan Xu, Wanli Ouyang, Elisa Ricci, Xiaogang Wang, and Nicu Sebe. 2017. Learning Cross-Modal Deep Representations for Robust Pedestrian Detection. In Proc. of the IEEE Conference on Computer Vision and Pattern Recognition. 4236–4244.
[11]
Meng Yang, Bao Zhang, and Yulong Song. 2018. Application of Support Vector Machine Based on Optimized Kernel Function in People Detection. Laser & Optoelectronics Progress 55, 4 (2018), 041001.

Index Terms

  1. Pedestrian Detection Algorithm of YOLOV8 Based on Feature Enhancement

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CACML '24: Proceedings of the 2024 3rd Asia Conference on Algorithms, Computing and Machine Learning
    March 2024
    478 pages
    ISBN:9798400716416
    DOI:10.1145/3654823
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 May 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. CUHK
    2. YOLOV8
    3. attention mechanism
    4. feature fusion capability
    5. object detection
    6. pedestrian detection

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    CACML 2024

    Acceptance Rates

    Overall Acceptance Rate 93 of 241 submissions, 39%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 17
      Total Downloads
    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Nov 2024

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media