Computer Science > Machine Learning

arXiv:1611.02770 (cs)

[Submitted on 8 Nov 2016 (v1), last revised 7 Feb 2017 (this version, v3)]

Title:Delving into Transferable Adversarial Examples and Black-box Attacks

Authors:Yanpei Liu, Xinyun Chen, Chang Liu, Dawn Song

View PDF

Abstract:An intriguing property of deep neural networks is the existence of adversarial examples, which can transfer among different architectures. These transferable adversarial examples may severely hinder deep neural network-based applications. Previous works mostly study the transferability using small scale datasets. In this work, we are the first to conduct an extensive study of the transferability over large models and a large scale dataset, and we are also the first to study the transferability of targeted adversarial examples with their target labels. We study both non-targeted and targeted adversarial examples, and show that while transferable non-targeted adversarial examples are easy to find, targeted adversarial examples generated using existing approaches almost never transfer with their target labels. Therefore, we propose novel ensemble-based approaches to generating transferable adversarial examples. Using such approaches, we observe a large proportion of targeted adversarial examples that are able to transfer with their target labels for the first time. We also present some geometric studies to help understanding the transferable adversarial examples. Finally, we show that the adversarial examples generated using ensemble-based approaches can successfully attack this http URL, which is a black-box image classification system.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1611.02770 [cs.LG]
	(or arXiv:1611.02770v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1611.02770

Submission history

From: Xinyun Chen [view email]
[v1] Tue, 8 Nov 2016 23:25:00 UTC (4,546 KB)
[v2] Mon, 21 Nov 2016 22:28:51 UTC (5,312 KB)
[v3] Tue, 7 Feb 2017 14:24:44 UTC (5,328 KB)

Computer Science > Machine Learning

Title:Delving into Transferable Adversarial Examples and Black-box Attacks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Delving into Transferable Adversarial Examples and Black-box Attacks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators