Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.03060 (cs)

[Submitted on 6 Aug 2023]

Title:TOPIQ: A Top-down Approach from Semantics to Distortions for Image Quality Assessment

Authors:Chaofeng Chen, Jiadi Mo, Jingwen Hou, Haoning Wu, Liang Liao, Wenxiu Sun, Qiong Yan, Weisi Lin

View PDF

Abstract:Image Quality Assessment (IQA) is a fundamental task in computer vision that has witnessed remarkable progress with deep neural networks. Inspired by the characteristics of the human visual system, existing methods typically use a combination of global and local representations (\ie, multi-scale features) to achieve superior performance. However, most of them adopt simple linear fusion of multi-scale features, and neglect their possibly complex relationship and interaction. In contrast, humans typically first form a global impression to locate important regions and then focus on local details in those regions. We therefore propose a top-down approach that uses high-level semantics to guide the IQA network to focus on semantically important local distortion regions, named as \emph{TOPIQ}. Our approach to IQA involves the design of a heuristic coarse-to-fine network (CFANet) that leverages multi-scale features and progressively propagates multi-level semantic information to low-level representations in a top-down manner. A key component of our approach is the proposed cross-scale attention mechanism, which calculates attention maps for lower level features guided by higher level features. This mechanism emphasizes active semantic regions for low-level distortions, thereby improving performance. CFANet can be used for both Full-Reference (FR) and No-Reference (NR) IQA. We use ResNet50 as its backbone and demonstrate that CFANet achieves better or competitive performance on most public FR and NR benchmarks compared with state-of-the-art methods based on vision transformers, while being much more efficient (with only ${\sim}13\%$ FLOPS of the current best FR method). Codes are released at \url{this https URL}.

Comments:	13 pages, 8 figures, 10 tables. In submission
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.03060 [cs.CV]
	(or arXiv:2308.03060v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.03060

Submission history

From: Chaofeng Chen [view email]
[v1] Sun, 6 Aug 2023 09:08:37 UTC (23,171 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TOPIQ: A Top-down Approach from Semantics to Distortions for Image Quality Assessment

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TOPIQ: A Top-down Approach from Semantics to Distortions for Image Quality Assessment

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators