Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3569966.3570061acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsseConference Proceedingsconference-collections
research-article

A Object Detection Network for Remote Sensing Images Based on Global Multiscale Attention and Local Context Dynamic Interaction

Published: 20 December 2022 Publication History

Abstract

Abstract: Remote sensing image (RSI) object detection has been applied in the fields of marine information mining, building change detection, and ship monitoring. However, the application of advanced general-purpose object detection of RSI usually has significant degradation of performance. This is mainly due to the special natures of RSI such as complex background interference, arbitrarily oriented objects, small objects, and multi-scale. For these issues, we propose a novel object detection network (GLC-Net) for RSI based on global multiscale attention and local context dynamic interaction. Specifically, a global multiscale attention module (GMA) is designed to mitigate background interference. By incorporating the attention mechanism in the multiscale feature map, the model can focus on remote sensing object features and improve the efficiency and accuracy of detection tasks. To deal with the problem of the arbitrary orientation of objects, the rotated region proposal network is utilized with reference to the centroid offset representation. For the lack of information on small objects and the cross-scale problem, Contextual Dynamic Interaction Detection Head (CDI Head) is designed for feature filtering and enhancement by extracting local contextual information and ROI features in a one-to-one dynamic interaction. Extensive experiments had been conducted on two public benchmark datasets. The experiments demonstrate that the proposed GLC-Net has superior performance in RSI object detection.

References

[1]
A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Advances in neural information processing systems, vol. 25, 2012.
[2]
G.-S. Xia, "DOTA: A large-scale dataset for object detection in aerial images," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 3974-3983.
[3]
Z. Liu, H. Wang, L. Weng, and Y. Yang, "Ship rotated bounding box space for ship extraction from high-resolution optical satellite images with complex backgrounds," IEEE Geoscience and Remote Sensing Letters, vol. 13, no. 8, pp. 1074-1078, 2016.
[4]
R. Girshick, "Fast r-cnn," in Proceedings of the IEEE international conference on computer vision, 2015, pp. 1440-1448.
[5]
S. Ren, K. He, R. Girshick, and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks," Advances in neural information processing systems, vol. 28, 2015.
[6]
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 779-788.
[7]
T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, "Focal loss for dense object detection," in Proceedings of the IEEE international conference on computer vision, 2017, pp. 2980-2988.
[8]
W. Liu, L. Ma, and H. Chen, "Arbitrary-oriented ship detection framework in optical remote-sensing images," IEEE Geoscience and Remote Sensing Letters, vol. 15, no. 6, pp. 937-941, 2018.
[9]
J. Redmon and A. Farhadi, "YOLO9000: better, faster, stronger," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 7263-7271.
[10]
Z. Zhang, W. Guo, S. Zhu, and W. Yu, "Toward arbitrary-oriented ship detection with rotated region proposal and discrimination networks," IEEE Geoscience and Remote Sensing Letters, vol. 15, no. 11, pp. 1745-1749, 2018.
[11]
J. Ding, N. Xue, Y. Long, G.-S. Xia, and Q. Lu, "Learning RoI transformer for oriented object detection in aerial images," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 2849-2858.
[12]
Y. Xu, "Gliding vertex on the horizontal bounding box for multi-oriented object detection," IEEE transactions on pattern analysis and machine intelligence, vol. 43, no. 4, pp. 1452-1459, 2020.
[13]
X. Xie, G. Cheng, J. Wang, X. Yao, and J. Han, "Oriented R-CNN for object detection," in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 3520-3529.
[14]
T.-Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, and S. Belongie, "Feature pyramid networks for object detection," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 2117-2125.
[15]
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770-778.
[16]
J. Hu, L. Shen, and G. Sun, "Squeeze-and-excitation networks," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 7132-7141.
[17]
G. Zhang, S. Lu, and W. Zhang, "CAD-Net: A context-aware detection network for objects in remote sensing imagery," IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 12, pp. 10015-10024, 2019.
[18]
X. Yang, J. Yan, Z. Feng, and T. He, "R3det: Refined single-stage detector with feature refinement for rotating object," in Proceedings of the AAAI conference on artificial intelligence, 2021, vol. 35, no. 4, pp. 3163-3171.
[19]
J. Han, J. Ding, J. Li, and G.-S. Xia, "Align deep features for oriented object detection," IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-11, 2021.
[20]
X. Yang, "Scrdet: Towards more robust detection for small, cluttered and rotated objects," in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 8232-8241.
[21]
J. Yi, P. Wu, B. Liu, Q. Huang, H. Qu, and D. Metaxas, "Oriented object detection in aerial images with box boundary-aware vectors," in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2021, pp. 2150-2159.
[22]
X. Zhang, G. Wang, P. Zhu, T. Zhang, C. Li, and L. Jiao, "GRS-Det: An anchor-free rotation ship detector based on Gaussian-mask in remote sensing images," IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 4, pp. 3518-3531, 2020.
[23]
Y. Jiang, "R2CNN: Rotational region CNN for orientation robust scene text detection," arXiv preprint arXiv:1706.09579, 2017.

Index Terms

  1. A Object Detection Network for Remote Sensing Images Based on Global Multiscale Attention and Local Context Dynamic Interaction

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CSSE '22: Proceedings of the 5th International Conference on Computer Science and Software Engineering
    October 2022
    753 pages
    ISBN:9781450397780
    DOI:10.1145/3569966
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 December 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Contextual Dynamic Interaction
    2. Remote sensing image
    3. attention mechanism
    4. object detection

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    • the Natural Science Foundation of Fujian Province of China
    • the High-level Talent Project of Xiamen University of Technology
    • the Foundation of Educational and Scientific Research Projects for Young and Middle-aged Teachers of Fujian Province

    Conference

    CSSE 2022

    Acceptance Rates

    Overall Acceptance Rate 33 of 74 submissions, 45%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 20
      Total Downloads
    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 16 Nov 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media