Computer Science > Machine Learning

arXiv:2206.04316 (cs)

[Submitted on 9 Jun 2022]

Title:Adversarial Noises Are Linearly Separable for (Nearly) Random Neural Networks

Authors:Huishuai Zhang, Da Yu, Yiping Lu, Di He

View PDF

Abstract:Adversarial examples, which are usually generated for specific inputs with a specific model, are ubiquitous for neural networks. In this paper we unveil a surprising property of adversarial noises when they are put together, i.e., adversarial noises crafted by one-step gradient methods are linearly separable if equipped with the corresponding labels. We theoretically prove this property for a two-layer network with randomly initialized entries and the neural tangent kernel setup where the parameters are not far from initialization. The proof idea is to show the label information can be efficiently backpropagated to the input while keeping the linear separability. Our theory and experimental evidence further show that the linear classifier trained with the adversarial noises of the training data can well classify the adversarial noises of the test data, indicating that adversarial noises actually inject a distributional perturbation to the original data distribution. Furthermore, we empirically demonstrate that the adversarial noises may become less linearly separable when the above conditions are compromised while they are still much easier to classify than original features.

Comments:	13 pages
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2206.04316 [cs.LG]
	(or arXiv:2206.04316v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.04316

Submission history

From: Huishuai Zhang [view email]
[v1] Thu, 9 Jun 2022 07:26:46 UTC (111 KB)

Computer Science > Machine Learning

Title:Adversarial Noises Are Linearly Separable for (Nearly) Random Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adversarial Noises Are Linearly Separable for (Nearly) Random Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators