Computer Science > Computer Vision and Pattern Recognition

arXiv:2101.06658 (cs)

[Submitted on 17 Jan 2021 (v1), last revised 23 Apr 2021 (this version, v2)]

Title:Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution

Authors:Yan Wu, Zhiwu Huang, Suryansh Kumar, Rhea Sanjay Sukthanker, Radu Timofte, Luc Van Gool

View PDF

Abstract:Modern solutions to the single image super-resolution (SISR) problem using deep neural networks aim not only at better performance accuracy but also at a lighter and computationally efficient model. To that end, recently, neural architecture search (NAS) approaches have shown some tremendous potential. Following the same underlying, in this paper, we suggest a novel trilevel NAS method that provides a better balance between different efficiency metrics and performance to solve SISR. Unlike available NAS, our search is more complete, and therefore it leads to an efficient, optimized, and compressed architecture. We innovatively introduce a trilevel search space modeling, i.e., hierarchical modeling on network-, cell-, and kernel-level structures. To make the search on trilevel spaces differentiable and efficient, we exploit a new sparsestmax technique that is excellent at generating sparse distributions of individual neural architecture candidates so that they can be better disentangled for the final selection from the enlarged search space. We further introduce the sorting technique to the sparsestmax relaxation for better network-level compression. The proposed NAS optimization additionally facilitates simultaneous search and training in a single phase, reducing search time and train time. Comprehensive evaluations on the benchmark datasets show our method's clear superiority over the state-of-the-art NAS in terms of a good trade-off between model size, performance, and efficiency.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2101.06658 [cs.CV]
	(or arXiv:2101.06658v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2101.06658

Submission history

From: Zhiwu Huang [view email]
[v1] Sun, 17 Jan 2021 12:19:49 UTC (7,835 KB)
[v2] Fri, 23 Apr 2021 15:50:09 UTC (2,488 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators