ISAR Image Quality Assessment Based on Visual Attention Model
<p>The architecture of the proposed model. The image is partitioned into 8 × 8-sized patches. Then, the linear projection layer performs a convolution operation on these patches to acquire patch embeddings, which can be processed by the Transformer encoder. The Gram–T model computes the Gramin matrix of extracted features to obtain score tokens through one attention layer. Moreover, the CAB and IRAB strength interactions between channels and regions within features are output by the Transformer encoder. Finally, the prediction score from the IRAB is added by a score token from Gram–T to obtain the final score.</p> "> Figure 2
<p>CAB module.</p> "> Figure 3
<p>IRAB module.</p> "> Figure 4
<p>Display of the results in the training process.</p> "> Figure 5
<p>Example images from ISAR image dataset. The first caption row below the images refers to the ground truth scores of the according images, and the second row refers to prediction scores.</p> "> Figure 6
<p>Attention heatmap analysis. The attention heatmap is on the left, and the original image is on the right. In our experiments, the blue color in the attention heatmap represents the high weight, and the red color represents low weight.</p> ">
Abstract
:1. Introduction
- (1)
- In this paper, Gram–T is designed for the quality assessment of ISAR images. The Gram–T model is highly adaptable and flexible when dealing with sequence data. By transforming an image into sequence data, the Transformer encoder can effectively grab spatial relationships and global information in the image. Furthermore, computing the Gramin matrix enhances the contrast of features representing different quality levels of ISAR images, thereby strengthening the association between ISAR image features and quality levels. In this way, a strong association between features and image quality can be explored, conducive to more discriminative features.
- (2)
- We employ the Channel Attention Block (CAB) and Inter-Region Attention Block (IRAB) to enhance interactions between channels and space within features. The spatial dimension-oriented attention module captures local scattering point features, while the channel-oriented attention module captures semantic association information on channels. The CAB and IRAB modules enhance the extraction of features that characterize target clarity and image quality.
- (3)
- Structure and texture constraints are proposed to boost model performance due to the exclusive properties of ISAR images. Extensive experiments are conducted on the ISAR dataset. Compared with various network architectures and the latest ISAR image evaluation methods, our method achieves the best performance. Ablation experiments demonstrate the effectiveness of each module.
2. Proposed Method
2.1. Overall Architecture
2.2. Gramin Transformer
2.3. Channel Attention Block
2.4. Inter-Region Attention Block
2.5. Loss Function
3. Experimental Setup and Analysis of Results
4. Discussion of the Method
4.1. Ablation Studies
4.2. Attention Heatmap Analysis
4.3. Potential Applications of Gram–T
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Lee, S.J.; Lee, M.J.; Kim, K.T.; Bae, J.H. Classification of ISAR Images Using Variable Cross-Range Resolutions. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 2291–2303. [Google Scholar] [CrossRef]
- Benedek, C.; Martorella, M. Moving Target Analysis in ISAR Image Sequences with a Multiframe Marked Point Process Model. IEEE Trans. Geosci. Remote Sens. 2014, 52, 2234–2246. [Google Scholar] [CrossRef]
- Bosse, S.; Maniry, D.; Müller, K.; Wiegand, T.; Samek, W. Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment. IEEE Trans. Image Process. 2018, 27, 206–219. [Google Scholar] [CrossRef]
- Ebrahimi, S.; Vladimir, Y.M. Image Quality Improvement in Kidney Stone Detection on Computed Tomography Images. J. Image Graph. 2015, 3, 40–46. [Google Scholar] [CrossRef]
- Chandler, D.M. Seven challenges in image quality assessment: Past present and future research. Int. Sch. Res. Not. 2013, 2013, 905685. [Google Scholar] [CrossRef]
- Ju, Y.W.; Zhang, Y. Research on ISAR image quality evaluation. Syst. Eng. Electron. 2015, 37, 297–303. [Google Scholar]
- Huang, L.; Wang, Y.; Jin, S. A Quantitative Evaluation Approach for ISAR Image Performance. Radar Sci. Technol. 2017, 15, 43–49. [Google Scholar]
- Li, J.; Tian, B.; Li, S.; Wang, Y.; He, T.; Xu, S. Inverse Synthetic Aperture Radar Image Quality Assessment Based on BP Neural Network. In Proceedings of the 2023 8th International Conference on Signal and Image Processing (ICSIP), Wuxi, China, 8–10 July 2023; pp. 414–418. [Google Scholar]
- Jasinski, T.; Rosenberg, L.; Antipov, I. Automated ISAR Image Quality Assessment. In Proceedings of the 2023 IEEE International Radar Conference (RADAR), Sydney, Australia, 6–10 November 2023; pp. 1–5. [Google Scholar] [CrossRef]
- Madhusudana, P.C.; Birkbeck, N.; Wang, Y.; Adsumilli, B.; Bovik, A.C. Image quality assessment using contrastive learning. IEEE Trans. Image Process. 2022, 31, 4149–4161. [Google Scholar] [CrossRef] [PubMed]
- Zhao, K.; Yuan, K.; Sun, M.; Li, M.; Wen, X. Quality-aware pre-trained models for blind image quality assessment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 22302–22313. [Google Scholar]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An image is worth 16 × 16 words: Transformers for image recognition at scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30. [Google Scholar]
- Cao, M.; Fan, Y.; Zhang, Y.; Wang, J.; Yang, Y. Vdtr: Video deblurring with transformer. arXiv 2022, arXiv:2204.08023. [Google Scholar] [CrossRef]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021. [Google Scholar]
- Ke, J.; Wang, Q.; Wang, Y.; Milanfar, P.; Yang, F. Musiq: Multi-scale image quality transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 5148–5157. [Google Scholar]
- Tong, J.; Yang, Q.; Shen, L.; Li, B.; Chu, S. ISAR image evaluation method based on visual perception supervised learning. In Proceedings of the SPIE 12803, Fifth International Conference on Artificial Intelligence and Computer Science (AICS 2023), Wuhan, China, 26–28 July 2023. [Google Scholar]
- Zhang, J. ISAR Image Quality Grade Evaluation of Space Targets Based on Fusion Feature. In Proceedings of the World Conference on Intelligent and 3-D Technologies (WCI3DT 2022) Methods, Algorithms and Applications; Springer Nature: Singapore, 2023; pp. 53–63. [Google Scholar]
- Mittal, A.; Moorthy, A.K.; Bovik, A.C. No-reference image quality assessment in the spatial domain. IEEE Trans. Image Process. 2012, 21, 4695–4708. [Google Scholar] [CrossRef]
- Mittal, A.; Soundararajan, R.; Bovik, A.C. Making a “completely blind” image quality analyzer. IEEE Signal Process. Lett. 2012, 20, 209–212. [Google Scholar] [CrossRef]
- Liu, S.; Deng, W. Very deep convolutional neural network based image classification using small training sample size. In Proceedings of the 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR), Kuala Lumpur, Malaysia, 3–6 November 2015; pp. 730–734. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Methods | SRCC | PLCC | Score | Inference Time(s) |
---|---|---|---|---|
MQEM [7] | 0.6411 | 0.5652 | 1.206 | 0.30 |
BRISQUE [19] | 0.6501 | 0.5844 | 1.235 | 0.20 |
NIQE [20] | 0.5312 | 0.5382 | 1.069 | 0.10 |
VGG16 [21] | 0.4564 | 0.4148 | 0.871 | 0.15 |
Resnet18 | 0.7102 | 0.6507 | 1.361 | 0.24 |
Resnet34 | 0.8697 | 0.8296 | 1.699 | 0.36 |
Resnet50 [22] | 0.8712 | 0.8499 | 1.721 | 0.47 |
ViT [12] | 0.8731 | 0.8467 | 1.720 | 0.28 |
CNN-based Regression [9] | 0.5531 | 0.5437 | 1.097 | 0.43 |
KNN-based Regression [9] | 0.4626 | 0.4782 | 0.941 | 0.22 |
VPSL [17] | 0.6916 | 0.6372 | 1.329 | 0.34 |
EBFF [18] | 0.7941 | 0.7219 | 1.516 | 0.65 |
Ours | 0.9103 | 0.8627 | 1.773 | 0.31 |
Methods | SRCC | PLCC | Score |
---|---|---|---|
Gram–T | 0.8831 | 0.8477 | 1.730 |
Gram–T+CAB | 0.8874 | 0.8567 | 1.744 |
Gram–T+IRAB | 0.9072 | 0.8645 | 1.771 |
Gram–T+CAB+IRAB | 0.9103 | 0.8627 | 1.773 |
Scale Factor | SRCC | PLCC | Score |
---|---|---|---|
0 | 0.8874 | 0.8567 | 1.744 |
0.5 | 0.9071 | 0.8560 | 1.763 |
1 | 0.8972 | 0.8551 | 1.752 |
0.8 | 0.9103 | 0.8627 | 1.773 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhang, J.; Zhao, Z.; Tian, X. ISAR Image Quality Assessment Based on Visual Attention Model. Appl. Sci. 2025, 15, 1996. https://doi.org/10.3390/app15041996
Zhang J, Zhao Z, Tian X. ISAR Image Quality Assessment Based on Visual Attention Model. Applied Sciences. 2025; 15(4):1996. https://doi.org/10.3390/app15041996
Chicago/Turabian StyleZhang, Jun, Zhicheng Zhao, and Xilan Tian. 2025. "ISAR Image Quality Assessment Based on Visual Attention Model" Applied Sciences 15, no. 4: 1996. https://doi.org/10.3390/app15041996
APA StyleZhang, J., Zhao, Z., & Tian, X. (2025). ISAR Image Quality Assessment Based on Visual Attention Model. Applied Sciences, 15(4), 1996. https://doi.org/10.3390/app15041996