Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2204.04216 (eess)

[Submitted on 8 Apr 2022 (v1), last revised 20 Apr 2022 (this version, v3)]

Title:Learning Trajectory-Aware Transformer for Video Super-Resolution

Authors:Chengxu Liu, Huan Yang, Jianlong Fu, Xueming Qian

View PDF

Abstract:Video super-resolution (VSR) aims to restore a sequence of high-resolution (HR) frames from their low-resolution (LR) counterparts. Although some progress has been made, there are grand challenges to effectively utilize temporal dependency in entire video sequences. Existing approaches usually align and aggregate video frames from limited adjacent frames (e.g., 5 or 7 frames), which prevents these approaches from satisfactory results. In this paper, we take one step further to enable effective spatio-temporal learning in videos. We propose a novel Trajectory-aware Transformer for Video Super-Resolution (TTVSR). In particular, we formulate video frames into several pre-aligned trajectories which consist of continuous visual tokens. For a query token, self-attention is only learned on relevant visual tokens along spatio-temporal trajectories. Compared with vanilla vision Transformers, such a design significantly reduces the computational cost and enables Transformers to model long-range features. We further propose a cross-scale feature tokenization module to overcome scale-changing problems that often occur in long-range videos. Experimental results demonstrate the superiority of the proposed TTVSR over state-of-the-art models, by extensive quantitative and qualitative evaluations in four widely-used video super-resolution benchmarks. Both code and pre-trained models can be downloaded at this https URL.

Comments:	CVPR 2022 Oral
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2204.04216 [eess.IV]
	(or arXiv:2204.04216v3 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2204.04216

Submission history

From: Chengxu Liu [view email]
[v1] Fri, 8 Apr 2022 03:37:39 UTC (5,972 KB)
[v2] Sat, 16 Apr 2022 13:17:30 UTC (5,979 KB)
[v3] Wed, 20 Apr 2022 04:38:21 UTC (5,979 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Learning Trajectory-Aware Transformer for Video Super-Resolution

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Learning Trajectory-Aware Transformer for Video Super-Resolution

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators