Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.06983 (cs)

[Submitted on 10 Oct 2022 (v1), last revised 7 Mar 2023 (this version, v4)]

Title:Denoising Masked AutoEncoders Help Robust Classification

Authors:Quanlin Wu, Hang Ye, Yuntian Gu, Huishuai Zhang, Liwei Wang, Di He

View PDF

Abstract:In this paper, we propose a new self-supervised method, which is called Denoising Masked AutoEncoders (DMAE), for learning certified robust classifiers of images. In DMAE, we corrupt each image by adding Gaussian noises to each pixel value and randomly masking several patches. A Transformer-based encoder-decoder model is then trained to reconstruct the original image from the corrupted one. In this learning paradigm, the encoder will learn to capture relevant semantics for the downstream tasks, which is also robust to Gaussian additive noises. We show that the pre-trained encoder can naturally be used as the base classifier in Gaussian smoothed models, where we can analytically compute the certified radius for any data point. Although the proposed method is simple, it yields significant performance improvement in downstream classification tasks. We show that the DMAE ViT-Base model, which just uses 1/10 parameters of the model developed in recent work arXiv:2206.10550, achieves competitive or better certified accuracy in various settings. The DMAE ViT-Large model significantly surpasses all previous results, establishing a new state-of-the-art on ImageNet dataset. We further demonstrate that the pre-trained model has good transferability to the CIFAR-10 dataset, suggesting its wide adaptability. Models and code are available at this https URL.

Comments:	ICLR 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2210.06983 [cs.CV]
	(or arXiv:2210.06983v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.06983

Submission history

From: Quanlin Wu [view email]
[v1] Mon, 10 Oct 2022 12:37:59 UTC (3,552 KB)
[v2] Sun, 23 Oct 2022 03:27:14 UTC (3,552 KB)
[v3] Tue, 1 Nov 2022 07:37:28 UTC (3,552 KB)
[v4] Tue, 7 Mar 2023 13:19:13 UTC (7,883 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Denoising Masked AutoEncoders Help Robust Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Denoising Masked AutoEncoders Help Robust Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators