Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.11840 (cs)

[Submitted on 24 Apr 2023]

Title:Robust and Efficient Memory Network for Video Object Segmentation

Authors:Yadang Chen, Dingwei Zhang, Zhi-xin Yang, Enhua Wu

View PDF

Abstract:This paper proposes a Robust and Efficient Memory Network, referred to as REMN, for studying semi-supervised video object segmentation (VOS). Memory-based methods have recently achieved outstanding VOS performance by performing non-local pixel-wise matching between the query and memory. However, these methods have two limitations. 1) Non-local matching could cause distractor objects in the background to be incorrectly segmented. 2) Memory features with high temporal redundancy consume significant computing resources. For limitation 1, we introduce a local attention mechanism that tackles the background distraction by enhancing the features of foreground objects with the previous mask. For limitation 2, we first adaptively decide whether to update the memory features depending on the variation of foreground objects to reduce temporal redundancy. Second, we employ a dynamic memory bank, which uses a lightweight and differentiable soft modulation gate to decide how many memory features need to be removed in the temporal dimension. Experiments demonstrate that our REMN achieves state-of-the-art results on DAVIS 2017, with a $\mathcal{J\&F}$ score of 86.3% and on YouTube-VOS 2018, with a $\mathcal{G}$ over mean of 85.5%. Furthermore, our network shows a high inference speed of 25+ FPS and uses relatively few computing resources.

Comments:	Accepted by ICME 2023. 6 pages, 6 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
Cite as:	arXiv:2304.11840 [cs.CV]
	(or arXiv:2304.11840v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.11840

Submission history

From: Dingwei Zhang [view email]
[v1] Mon, 24 Apr 2023 06:19:21 UTC (8,509 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Robust and Efficient Memory Network for Video Object Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Robust and Efficient Memory Network for Video Object Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators