research-article

A Object Detection Network for Remote Sensing Images Based on Global Multiscale Attention and Local Context Dynamic Interaction

Authors:

Weijie WengAuthors Info & Claims

CSSE '22: Proceedings of the 5th International Conference on Computer Science and Software Engineering

Pages 325 - 331

https://doi.org/10.1145/3569966.3570061

Published: 20 December 2022 Publication History

Abstract

Abstract: Remote sensing image (RSI) object detection has been applied in the fields of marine information mining, building change detection, and ship monitoring. However, the application of advanced general-purpose object detection of RSI usually has significant degradation of performance. This is mainly due to the special natures of RSI such as complex background interference, arbitrarily oriented objects, small objects, and multi-scale. For these issues, we propose a novel object detection network (GLC-Net) for RSI based on global multiscale attention and local context dynamic interaction. Specifically, a global multiscale attention module (GMA) is designed to mitigate background interference. By incorporating the attention mechanism in the multiscale feature map, the model can focus on remote sensing object features and improve the efficiency and accuracy of detection tasks. To deal with the problem of the arbitrary orientation of objects, the rotated region proposal network is utilized with reference to the centroid offset representation. For the lack of information on small objects and the cross-scale problem, Contextual Dynamic Interaction Detection Head (CDI Head) is designed for feature filtering and enhancement by extracting local contextual information and ROI features in a one-to-one dynamic interaction. Extensive experiments had been conducted on two public benchmark datasets. The experiments demonstrate that the proposed GLC-Net has superior performance in RSI object detection.

References

[1]

A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Advances in neural information processing systems, vol. 25, 2012.

[2]

G.-S. Xia, "DOTA: A large-scale dataset for object detection in aerial images," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 3974-3983.

[3]

Z. Liu, H. Wang, L. Weng, and Y. Yang, "Ship rotated bounding box space for ship extraction from high-resolution optical satellite images with complex backgrounds," IEEE Geoscience and Remote Sensing Letters, vol. 13, no. 8, pp. 1074-1078, 2016.

[4]

R. Girshick, "Fast r-cnn," in Proceedings of the IEEE international conference on computer vision, 2015, pp. 1440-1448.

Digital Library

[5]

S. Ren, K. He, R. Girshick, and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks," Advances in neural information processing systems, vol. 28, 2015.

[6]

J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 779-788.

[7]

T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, "Focal loss for dense object detection," in Proceedings of the IEEE international conference on computer vision, 2017, pp. 2980-2988.

[8]

W. Liu, L. Ma, and H. Chen, "Arbitrary-oriented ship detection framework in optical remote-sensing images," IEEE Geoscience and Remote Sensing Letters, vol. 15, no. 6, pp. 937-941, 2018.

[9]

J. Redmon and A. Farhadi, "YOLO9000: better, faster, stronger," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 7263-7271.

[10]

Z. Zhang, W. Guo, S. Zhu, and W. Yu, "Toward arbitrary-oriented ship detection with rotated region proposal and discrimination networks," IEEE Geoscience and Remote Sensing Letters, vol. 15, no. 11, pp. 1745-1749, 2018.

[11]

J. Ding, N. Xue, Y. Long, G.-S. Xia, and Q. Lu, "Learning RoI transformer for oriented object detection in aerial images," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 2849-2858.

[12]

Y. Xu, "Gliding vertex on the horizontal bounding box for multi-oriented object detection," IEEE transactions on pattern analysis and machine intelligence, vol. 43, no. 4, pp. 1452-1459, 2020.

[13]

X. Xie, G. Cheng, J. Wang, X. Yao, and J. Han, "Oriented R-CNN for object detection," in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 3520-3529.

[14]

T.-Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, and S. Belongie, "Feature pyramid networks for object detection," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 2117-2125.

[15]

K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770-778.

[16]

J. Hu, L. Shen, and G. Sun, "Squeeze-and-excitation networks," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 7132-7141.

[17]

G. Zhang, S. Lu, and W. Zhang, "CAD-Net: A context-aware detection network for objects in remote sensing imagery," IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 12, pp. 10015-10024, 2019.

[18]

X. Yang, J. Yan, Z. Feng, and T. He, "R3det: Refined single-stage detector with feature refinement for rotating object," in Proceedings of the AAAI conference on artificial intelligence, 2021, vol. 35, no. 4, pp. 3163-3171.

[19]

J. Han, J. Ding, J. Li, and G.-S. Xia, "Align deep features for oriented object detection," IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-11, 2021.

[20]

X. Yang, "Scrdet: Towards more robust detection for small, cluttered and rotated objects," in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 8232-8241.

[21]

J. Yi, P. Wu, B. Liu, Q. Huang, H. Qu, and D. Metaxas, "Oriented object detection in aerial images with box boundary-aware vectors," in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2021, pp. 2150-2159.

[22]

X. Zhang, G. Wang, P. Zhu, T. Zhang, C. Li, and L. Jiao, "GRS-Det: An anchor-free rotation ship detector based on Gaussian-mask in remote sensing images," IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 4, pp. 3518-3531, 2020.

[23]

Y. Jiang, "R2CNN: Rotational region CNN for orientation robust scene text detection," arXiv preprint arXiv:1706.09579, 2017.

Index Terms

A Object Detection Network for Remote Sensing Images Based on Global Multiscale Attention and Local Context Dynamic Interaction
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object detection

Recommendations

A Similarity Constraint Divergent Activation Method for Weakly Supervised Object Detection in Remote Sensing Images
Image and Graphics
Abstract
With the development of remote sensing technology and object detection technology, many fully-supervised convolutional neural networks (CNN) methods based on object labeling information such as bounding box have achieved good results in remote ...
Inception Parallel Attention Network for Small Object Detection in Remote Sensing Images
Pattern Recognition and Computer Vision
Abstract
Small object detection in remote sensing images is a major challenge in the field of computer vision. Most previous methods detect small objects using a multiscale feature fusion approach with the same weights. However, experiments shows that the ...
Performance analysis of different DCNN models in remote sensing image object detection
Abstract
In recent years, deep learning, especially deep convolutional neural networks (DCNN), has made great progress. Many researchers use different DCNN models to detect remote sensing targets. Different DCNN models have different advantages and ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

CSSE '22: Proceedings of the 5th International Conference on Computer Science and Software Engineering

October 2022

753 pages

ISBN:9781450397780

DOI:10.1145/3569966

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 December 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

the Natural Science Foundation of Fujian Province of China
the High-level Talent Project of Xiamen University of Technology
the Foundation of Educational and Scientific Research Projects for Young and Middle-aged Teachers of Fujian Province

Conference

CSSE 2022

CSSE 2022: 2022 5th International Conference on Computer Science and Software Engineering

October 21 - 23, 2022

Guilin, China

Acceptance Rates

Overall Acceptance Rate 33 of 74 submissions, 45%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
20
Total Downloads

Downloads (Last 12 months)4
Downloads (Last 6 weeks)1

Reflects downloads up to 16 Nov 2024

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents