Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3653863.3653873acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssipConference Proceedingsconference-collections
research-article
Open access

Enhanced Object Detection in Highly Compressed Images using Regions of Interest

Published: 07 June 2024 Publication History

Abstract

With the increasing popularity of digital media, the need to store and transmit large amounts of visual data has also increased. Image compression techniques can reduce file sizes and bandwidth requirements while maintaining an acceptable level of image quality. Classical techniques are designed to optimize images or videos for the Human Visual System (HVS), which takes into account the characteristics of the human eye and brain in perceiving and processing visual information. However, newer compression techniques, aiming not only at HVS optimization but also at improving performance when considering tasks driven by machines, are being developed. In this context, this paper proposes an efficient approach to enhance the performance of object detection Neural Networks (NNs) in highly compressed images, using Regions of Interest. We evaluate the mean Average Precision (mAP) in both Faster Region-based Convolutional Neural Network (R-CNN) and DyHead Neural Network (NN), considering two different application scenarios: generic object detection and industrial supervision. In comparison with the High Efficiency Video Coding (HEVC) standard, the proposed approach allows us to reduce the bitstream up to 40% while achieving a similar accuracy in the object detection task, regardless of the considered network. These results demonstrate that high compression ratios can be achieved while maintaining good image quality for machine visual perception in task-driven systems.

References

[1]
K. Fischer, C. Herglotz, and A. Kaup. 2020. On Intra Video Coding and In-Loop Filtering for Neural Object Detection Networks. 2020 IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates. pp. 1147-1151.
[2]
L.D. Chamain, F. Racape, J. Begaint, A. Purshparaja, and S. Feltman. 2021. End-to-end optimized image compression for multiple machine tasks. Data Compression Conference (DCC). pp. 163-172.
[3]
C. Hyomin, and I.V. Bajic. 2018. High efficiency compression for object detection. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). pp. 1792-1796.
[4]
L. Galteri, M. Bertini, L. Seidenari and A. Del Bimbo. 2018. Video Compression for Object Detection Algorithms. 24th International Conference on Pattern Recognition (ICPR), Beijing, China. pp. 3007-3012.
[5]
Joint Collaborative Team on Video Coding (JCT-VC) HEVC Test Model (HM) Software, https://vcgit.hhi.fraunhofer.de/jvet/HM, last accessed 2023/3/9.
[6]
Tsung-Yi Lin, M. Maire, S. Belongie, L. Bourdev, R. Girshick, J. Hays, P. Perona, D. Ramanan, C.L. Zitnick, and P. Dollar. 2014. Microsoft COCO: Common Objects in Context. Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland. pp. 740-755.
[7]
S. Ren, K. He, R.B. Girshick, and J. Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence. 39 pp. 1137-1149.
[8]
X. Dai, Y. Chen, B. Xiao, D. Chen, M. Liu, L. Yuan, and L. Zhang. 2021. Dynamic Head: Unifying Object Detection Heads with Attentions. Proceedings Of The IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 7373-7382.
[9]
MetaAI: Object Detection on COCO test-dev, https://paperswithcode.com/sota/object-detection-on-coco, last accessed 2023/3/6.
[10]
Y. Wu, A. Kirillov, F. Massa, Wan-Yen Lo and R. Girshick: Detectron2, https://github.com/facebookresearch/detectron2, last accessed 2023/3/6.

Index Terms

  1. Enhanced Object Detection in Highly Compressed Images using Regions of Interest

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    SSIP '23: Proceedings of the 2023 6th International Conference on Sensors, Signal and Image Processing
    October 2023
    69 pages
    ISBN:9798400707995
    DOI:10.1145/3653863
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 June 2024

    Check for updates

    Author Tags

    1. Highly Compressed Images
    2. Object Detection
    3. Regions of Interest
    4. Surveillance and Industrial Supervision

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    SSIP 2023

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 68
      Total Downloads
    • Downloads (Last 12 months)68
    • Downloads (Last 6 weeks)15
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media