Adversarial Deep Learning with Stackelberg Games

Aneesh Sreevallabh Chivukula⁹,
Xinghao Yang⁹ &
Wei Liu⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1142))

Included in the following conference series:

International Conference on Neural Information Processing

3049 Accesses

Abstract

Deep networks are vulnerable to adversarial attacks from malicious adversaries. Currently, many adversarial learning algorithms are designed to exploit such vulnerabilities in deep networks. These methods focus on attacking and retraining deep networks with adversarial examples to do either feature manipulation or label manipulation or both. In this paper, we propose a new adversarial learning algorithm for finding adversarial manipulations to deep networks. We formulate adversaries who optimize game-theoretic payoff functions on deep networks doing multi-label classifications. We model the interactions between a classifier and an adversary from a game-theoretic perspective and formulate their strategies into a Stackelberg game associated with a two-player problem. Then we design algorithms to solve for the Nash equilibrium, which is a pair of strategies from which there is no incentive for either the classifier or the adversary to deviate. In designing attack scenarios, the adversary’s objective is to deliberately make small changes to test data such that attacked samples are undetected. Our results illustrate that game-theoretic modelling is significantly effective in securing deep learning models against performance vulnerabilities attached by intelligent adversaries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Achieving optimal adversarial accuracy for adversarial deep learning using Stackelberg games

Article 30 August 2022

Applications of game theory in deep learning: a survey

Article 09 February 2022

Evaluation of adversarial attacks sensitivity of classifiers with occluded input data

Article 01 June 2022

Notes

1.
https://pytorch.org/docs/stable/index.html.

References

Biggio, B., Roli, F.: Wild patterns: ten years after the rise of adversarial machine learning. Pattern Recogn. 84, 317–331 (2018)
Article Google Scholar
Brückner, M., Kanzow, C., Scheffer, T.: Static prediction games for adversarial learning problems. J. Mach. Learn. Res. 13(1), 2617–2654 (2012)
MathSciNet MATH Google Scholar
Brückner, M., Scheffer, T.: Stackelberg games for adversarial prediction problems. In: Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) (2011)
Google Scholar
Carlini, N., Wagner, D.: Adversarial examples are not easily detected: bypassing ten detection methods. In: Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security (2017)
Google Scholar
Chivukula, A., Liu, W.: Adversarial deep learning models with multiple adversaries. IEEE Trans. Knowl. Data Eng. 31(6), 1066–1079 (2018)
Article Google Scholar
Chivukula, A.S., Liu, W.: Adversarial learning games with deep learning models. In: Proceedings of 2017 International Joint Conference on Neural Networks (IJCNN) (2017)
Google Scholar
Goodfellow, I., Shlens, J., Szegedy, C.: Explaining and harnessing adversarial examples. In: Proceedings of International Conference on Learning Representations (2015)
Google Scholar
Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., Courville, A.C.: Improved training of wasserstein GANs. In: Advances in Neural Information Processing Systems 30 (2017)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (2012)
Google Scholar
Liu, W., Chawla, S.: Mining adversarial patterns via regularized loss minimization. Mach. Learn. 81(1), 69–83 (2010)
Article MathSciNet Google Scholar
Lowd, D., Meek, C.: Adversarial learning. In: Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) (2005)
Google Scholar
Moosavi-Dezfooli, S., Fawzi, A., Frossard, P.: Deepfool: a simple and accurate method to fool deep neural networks. In: Proceedings of Conference on Computer Vision and Pattern Recognition CVPR (2016)
Google Scholar
Nisan, N., Roughgarden, T., Tardos, E., Vazirani, V.V.: Algorithmic Game Theory. Cambridge University Press, Cambridge (2007)
Google Scholar
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. CoRR (2015)
Google Scholar
Zhou, Y., Kantarcioglu, M., Xi, B.: A survey of game theoretic approach for adversarial machine learning. Wiley Interdisc. Rev. Data Min. Knowl. Discov. 9(3), e1259 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, University of Technology Sydney, Ultimo, Australia
Aneesh Sreevallabh Chivukula, Xinghao Yang & Wei Liu

Authors

Aneesh Sreevallabh Chivukula
View author publications
You can also search for this author in PubMed Google Scholar
Xinghao Yang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aneesh Sreevallabh Chivukula .

Editor information

Editors and Affiliations

Australian National University, Canberra, ACT, Australia
Tom Gedeon
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sreevallabh Chivukula, A., Yang, X., Liu, W. (2019). Adversarial Deep Learning with Stackelberg Games. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Communications in Computer and Information Science, vol 1142. Springer, Cham. https://doi.org/10.1007/978-3-030-36808-1_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-36808-1_1
Published: 05 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36807-4
Online ISBN: 978-3-030-36808-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Adversarial Deep Learning with Stackelberg Games

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Achieving optimal adversarial accuracy for adversarial deep learning using Stackelberg games

Applications of game theory in deep learning: a survey

Evaluation of adversarial attacks sensitivity of classifiers with occluded input data

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Adversarial Deep Learning with Stackelberg Games

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Achieving optimal adversarial accuracy for adversarial deep learning using Stackelberg games

Applications of game theory in deep learning: a survey

Evaluation of adversarial attacks sensitivity of classifiers with occluded input data

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation