Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3653781.3653818acmotherconferencesArticle/Chapter ViewAbstractPublication PagescvdlConference Proceedingsconference-collections
research-article

Multi-scale Semantic Information Refinement and Hybrid Dual Attention Module based Object Detection in Remote Sensing Images

Published: 01 June 2024 Publication History

Abstract

The high-resolution remote sensing images are of high research value in a lot of areas of life as remote sensing technology continues to develop. However, when compared to ordinary images, the more complicated scenes of remote sensing images, characterized by large scale variations of their target objects and complex backgrounds, which lead to the detection of these types of images still confront numerous challenges. Therefore, in response to the above challenges, this paper proposes a framework for object detection in remote sensing images with multi-scale semantic information refinement (MSSIR) and hybrid dual attention module (HDA). In the first place, MSSIR is designed in the feature pyramid section. This module is a multi-scale module with multi-level jump connections and it refines the semantic information of each feature layer in both top-down and bottom-up ways, while using jump connections to increase the connection between the deeper and shallower networks as a way to improve the extraction of feature information from each layer of the network. Meanwhile, transposed convolution is applied in the upsampling process of the feature pyramid, which allows the upsampling process to learn to achieve optimal results through the network. Secondly, the HDA module, which combines the dual benefits of channel attention and spatial attention, is designed to guide the network to concentrate more on feature information. For the last, the addition of the high level in the multi-scale module allows the network to learn further to higher level semantic information. The experimental results on three open datasets demonstrate that the proposed algorithm has remarkably improved the average accuracy with respect to the other algorithms.

References

[1]
Y. Wu, K. Zhang, J. Wang, Y. Wang, Q. Wang and X. Li, GCWNet: A Global Context-Weaving Network for Object Detection in Remote Sensing Images. in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-12, 2022, Art no. 5619912.
[2]
R Girshick, J Donahue, T Darrell, J Malik, Rich feature hierarchies for accurate object detection and semantic segmentation.Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2014:580-587.
[3]
S Ren,K He, R Girshick, J Sun,Faster R-CNN: towards real-time object detection with region proposal networks.IEEE Transactions on Pattern Analysis & Machine Intelligence.2017,39(6):1137-1149.
[4]
W Liu, D Anguelov, D Erhan,et al,SSD: single shot MultiBox detector.Computer vision-ECCV 2016.Lecture notes in computer science.2016,9905:21-37.
[5]
Z Tian,C Shen,H Chen,T He,FCOS: fully convol-utional one-stage object detection. ICCV.2019:9626-9635. arXiv.1904.01355.
[6]
Y Dong, C Yu, P Wang, Z Feng,Airplane detection of optical remote sensing images based on deep learning. Laser & Optoelectronics Progress.2020,57 (04): 102-108.
[7]
Y Wang, X Wang,Remote sensing image target detection model based on attention and feature fusion. Laser & Optoelectronics Progress.2021,58 (02): 363-371.
[8]
S Wei, X Zeng, Q Qu,M Wang, J Shi,HRSID: A High-Resolution SAR Images Dataset for Ship Detection and Instance Segmentation. IEEE Access. 2020, 8:120234-120254.
[9]
G Cheng, J Han, P Zhou, G Lei, Multi-class geospatial object detection and geographic image classification based on collection of part detectors.ISPRS Journal of Photogrammetry and Remote Sensing. 2014, 98: 119-132.
[10]
G Cheng, P Zhou, J Han, Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images.IEEE Transactions on Geoscience and Remote Sensing.2016,54(12):7405-7415.
[11]
A Krizhevsky, I Sutskever, G Hinton,Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems. 2012:1–9.
[12]
R Dong, D Xu, J Zhao, L Jiao, J An. Sig-NMS-Based Faster R-CNN Combining Transfer Learning for Small Target Detection in VHR Optical Remote Sensing Imagery. IEEE Trans. Geosci. Remote Sens. 2019,57: 8534–8545.
[13]
H Qiu, H Li, Q Wu, F Meng, H Shi, A2RMNet: Adaptively Aspect Ratio Multi-Scale Network for Object Detection in Remote Sensing Images. Remote Sens. 2019, 11(13):1594.
[14]
K Zhou, Z Zhang, C Gao, J Liu, Rotated Feature Network for Multiorientation Object Detection of Remote-Sensing Images. IEEE Geoscience Remote Sensing Letters. 2020,18:33-37.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
CVDL '24: Proceedings of the International Conference on Computer Vision and Deep Learning
January 2024
506 pages
ISBN:9798400718199
DOI:10.1145/3653804
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 June 2024

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

CVDL 2024

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 12
    Total Downloads
  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)1
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media