Computer Science > Computer Vision and Pattern Recognition

arXiv:1611.01236 (cs)

[Submitted on 4 Nov 2016 (v1), last revised 11 Feb 2017 (this version, v2)]

Title:Adversarial Machine Learning at Scale

Authors:Alexey Kurakin, Ian Goodfellow, Samy Bengio

View PDF

Abstract:Adversarial examples are malicious inputs designed to fool machine learning models. They often transfer from one model to another, allowing attackers to mount black box attacks without knowledge of the target model's parameters. Adversarial training is the process of explicitly training a model on adversarial examples, in order to make it more robust to attack or to reduce its test error on clean inputs. So far, adversarial training has primarily been applied to small problems. In this research, we apply adversarial training to ImageNet. Our contributions include: (1) recommendations for how to succesfully scale adversarial training to large models and datasets, (2) the observation that adversarial training confers robustness to single-step attack methods, (3) the finding that multi-step attack methods are somewhat less transferable than single-step attack methods, so single-step attacks are the best for mounting black-box attacks, and (4) resolution of a "label leaking" effect that causes adversarially trained models to perform better on adversarial examples than on clean examples, because the adversarial example construction process uses the true label and the model can learn to exploit regularities in the construction process.

Comments:	17 pages, 5 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1611.01236 [cs.CV]
	(or arXiv:1611.01236v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1611.01236

Submission history

From: Alexey Kurakin [view email]
[v1] Fri, 4 Nov 2016 01:11:02 UTC (139 KB)
[v2] Sat, 11 Feb 2017 00:15:46 UTC (140 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Adversarial Machine Learning at Scale

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Adversarial Machine Learning at Scale

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators