research-article

Exploiting Pre-Trained Models and Low-Frequency Preference for Cost-Effective Transfer-based Attack

Authors:

Jun HuangAuthors Info & Claims

ACM Transactions on Knowledge Discovery from Data, Volume 19, Issue 2

Article No.: 52, Pages 1 - 18

https://doi.org/10.1145/3680553

Published: 14 February 2025 Publication History

Abstract

The transferability of adversarial examples enables practical transfer-based attacks. However, existing theoretical analysis cannot effectively reveal what factors contribute to cross-model transferability. Furthermore, the assumption that the target model dataset is available together with expensive prices of training proxy models also leads to insufficient practicality. We first propose a novel frequency perspective to study the transferability and then identify two factors that impair the transferability: an unchangeable intrinsic difference term along with a controllable perturbation-related term. To enhance the transferability, an optimization task with the constraint that decreases the impact of the perturbation-related term is formulated and an approximate solution for the task is designed to address the intractability of Fourier expansion. To address the second issue, we suggest employing pre-trained models as proxy models, which are freely available. Leveraging these advancements, we introduce cost-effective transfer-based attack (CTA), which addresses the optimization task in pre-trained models. CTA can be unleashed against broad applications, at any time, with minimal effort and nearly zero cost to attackers. This remarkable feature indeed makes CTA an effective, versatile, and fundamental tool for attacking and understanding a wide range of target models, regardless of their architecture or training dataset used. Extensive experiments show impressive attack performance of CTA across various models trained in seven black-box domains, highlighting the broad applicability and effectiveness of CTA.

References

[1]

Adam Coates, A. Ng, and Honglak Lee. 2011. An Analysis of Single-Layer Networks in Unsupervised Feature Learning. In International Conference on Artificial Intelligence and Statistics, 215–223.

[2]

Jeremy Cohen, Elan Rosenfeld, and Zico Kolter. 2019. Certified Adversarial Robustness via Randomized Smoothing. In International Conference on Machine Learning. PMLR, 1310–1320.

[3]

Ambra Demontis, Marco Melis, Maura Pintor, Matthew Jagielski, Battista Biggio, Alina Oprea, Cristina Nita-Rotaru, and Fabio Roli. 2019. Why Do Adversarial Attacks Transfer? Explaining Transferability of Evasion and Poisoning Attacks. In 28th USENIX Security Symposium (USENIX Security 19). USENIX Association, Santa Clara, CA, 321–338. Retrieved from https://www.usenix.org/conference/usenixsecurity19/presentation/demontis

[4]

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, K. Li, and Li Fei-Fei. 2009. ImageNet: A Large-Scale Hierarchical Image Database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, 248–255.

[5]

Shi Dong, Ping Wang, and Khushnood Abbas. 2021. A Survey on Deep Learning and Its Applications. Computer Science Review 40 (2021), 100379 (2021). DOI:

Digital Library

[6]

Yinpeng Dong, Fangzhou Liao, Tianyu Pang, Hang Su, Jun Zhu, Xiaolin Hu, and Jianguo Li. 2018. Boosting Adversarial Attacks with Momentum. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 9185–9193.

[7]

Yinpeng Dong, Tianyu Pang, Hang Su, and Jun Zhu. 2019. Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 4307–4316.

[8]

Yifan Du, Zikang Liu, Junyi Li, and Wayne Xin Zhao. 2022. A Survey of Vision-Language Pre-Trained Models. In International Joint Conference on Artificial Intelligence, 5436–5443.

[9]

John C. Duchi, Peter L. Bartlett, and Martin J. Wainwright. 2011. Randomized smoothing for stochastic optimization. SIAM Journal on Optimization 22 (2011), 674–701. Retrieved from https://api.semanticscholar.org/CorpusID:1182594

Digital Library

[10]

Mingyuan Fan, Cen Chen, Chengyu Wang, and Jun Huang. 2023a. On the Trustworthiness Landscape of State-of-the-Art Generative Models: A Comprehensive Survey. arXiv:2307.16680. Retrieved from https://arxiv.org/abs/2307.16680

[11]

Mingyuan Fan, Cen Chen, Chengyu Wang, Wenmeng Zhou, and Jun Huang. 2023b. On the Robustness of Split Learning against Adversarial Attacks. In European Conference on Artificial Intelligence (ECAI ’23). IOS Press, 668–675.

[12]

Mingyuan Fan, Wenzhong Guo, Zuobin Ying, and Ximeng Liu. 2023c. Enhance Transferability of Adversarial Examples with Model Architecture. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’23). IEEE, 1–5.

[13]

Robert Geirhos, Patricia Rubisch, Claudio Michaelis, Matthias Bethge, Felix A. Wichmann, and Wieland Brendel. 2019. ImageNet-trained CNNs Are Biased towards Texture; Increasing Shape Bias Improves Accuracy and Robustness. In International Conference on Learning Representations. Retrieved from https://openreview.net/forum?id=Bygh9j09KX

[14]

Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Logan Engstrom, Brandon Tran, and Aleksander Madry. 2019. Adversarial Examples Are Not Bugs, They Are Features. In Advances in Neural Information Processing Systems. H. Wallach, H. Larochelle, A. Beygelzimer, F. d’Alché-Buc, E. Fox, and R. Garnett (Eds.), Vol. 32. Curran Associates, Inc. Retrieved from https://proceedings.neurips.cc/paper_files/paper/2019/file/e2c420d928d4bf8ce0ff2ec19b371514-Paper.pdf

[15]

Simon Kornblith, Mohammad Norouzi, Honglak Lee, and Geoffrey E. Hinton. 2019. Similarity of Neural Network Representations Revisited. In International Conference on Machine Learning, 3519–3529.

[16]

Alex Krizhevsky. 2009. Learning Multiple Layers of Features from Tiny Images, University of Toronto.

[17]

A. Kurakin, I. Goodfellow, and S. Bengio. 2017. Adversarial Examples in the Physical World. (2017). In 5th International Conference on Learning Representations, Workshop Track Proceedings.

[18]

Jiadong Lin, Chuanbiao Song, Kun He, Liwei Wang, and John E. Hopcroft. 2020. Nesterov Accelerated Gradient and Scale Invariance for Adversarial Attacks. arXiv:1908.06281.

[19]

Yuyang Long, Qi li Zhang, Boheng Zeng, Lianli Gao, Xianglong Liu, Jian Zhang, and Jingkuan Song. 2022. Frequency Domain Model Augmentation for Adversarial Attack. In Computer Vision - ECCV 2022 - 17th European Conference. Shai Avidan, Gabriel J. Brostow, Moustapha Cissé, Giovanni Maria Farinella, and Tal Hassner (Eds.), Vol. 13664. Springer, 549–566.

Digital Library

[20]

Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, and Adrian Vladu. 2018. Towards Deep Learning Models Resistant to Adversarial Attacks. In International Conference on Learning Representations.

[21]

S. Maji, J. Kannala, E. Rahtu, M. Blaschko, and A. Vedaldi. 2013. Fine-Grained Visual Classification of Aircraft. arxiv:1306.5151. Retrieved from https://arxiv.org/abs/1306.5151

[22]

Yuhao Mao, Chong Fu, Sai gang Wang, Shouling Ji, Xuhong Zhang, Zhenguang Liu, Junfeng Zhou, Alex X. Liu, Raheem A. Beyah, and Ting Wang. 2022. Transfer Attacks Revisited: A Large-Scale Empirical Study in Real Computer Vision Settings. In 2022 IEEE Symposium on Security and Privacy (SP), 1423–1439.

[23]

Muzammal Naseer, Salman Hameed Khan, M. H. Khan, Fahad Shahbaz Khan, and Fatih Murat Porikli. 2019. Cross-Domain Transferability of Adversarial Perturbations. In Advances in Neural Information Processing Systems, 12885–12895.

[24]

Yuval Netzer, Tao Wang, Adam Coates, A. Bissacco, Bo Wu, and A. Ng. 2011. Reading Digits in Natural Images with Unsupervised Feature Learning. In NIPS Workshop on Deep Learning and Unsupervised Feature Learning, Vol. 2011. Granada, 4.

[25]

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. In International Conference on Machine Learning, 8748–8763.

[26]

Nasim Rahaman, Aristide Baratin, Devansh Arpit, Felix Draxler, Min Lin, Fred Hamprecht, Yoshua Bengio, and Aaron Courville. 2019. On the Spectral Bias of Neural Networks. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 97). Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, 5301–5310. Retrieved from https://proceedings.mlr.press/v97/rahaman19a.html

[27]

Hadi Salman, Andrew Ilyas, Logan Engstrom, Ashish Kapoor, and Aleksander Madry. 2020. Do Adversarially Robust ImageNet Models Transfer Better? In Advances in Neural Information Processing Systems (NIPS ’20). Curran Associates Inc., Red Hook, NY, Article 298, 13 pages.

[28]

Florian Tramèr, Alexey Kurakin, Nicolas Papernot, Ian Goodfellow, Dan Boneh, and Patrick McDaniel. 2018. Ensemble Adversarial Training: Attacks and Defenses. In International Conference on Learning Representations.

[29]

C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie. 2011. FGVC-Aircraft Benchmark. Technical Report CNS-TR-2011-001. California Institute of Technology.

[30]

Xiaosen Wang and Kun He. 2021. Enhancing the Transferability of Adversarial Attacks through Variance Tuning. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1924–1933.

[31]

Xiaosen Wang, Xu He, Jingdong Wang, and Kun He. 2021a. Admix: Enhancing the Transferability of Adversarial Attacks. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 16138–16147.

[32]

Xin Wang, Jie Ren, Shuyu Lin, Xiangming Zhu, Yisen Wang, and Quanshi Zhang. 2021b. A Unified Approach to Interpreting and Boosting Adversarial Transferability. In International Conference on Learning Representations (ICLR).

[33]

Yilin Wang and Farzan Farnia. 2022. On the Role of Generalization in Transferability of Adversarial Examples. arXiv:2206.09238.

[34]

Yajie Wang, Yu-an Tan, Haoran Lyu, Shangbo Wu, Yuhang Zhao, and Yuanzhang Li. 2022. Toward Feature Space Adversarial Attack in the Frequency Domain. International Journal of Intelligent Systems 37, 12 (2022), 11019–11036.

Digital Library

[35]

Futa Waseda, Sosuke Nishikawa, Trung-Nghia Le, Huy Hoang Nguyen, and Isao Echizen. 2021. Closer Look at the Transferability of Adversarial Examples: How They Fool Different Models Differently. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 1360–1368.

[36]

Dongxian Wu, Yisen Wang, Shutao Xia, James Bailey, and Xingjun Ma. 2020. Skip Connections Matter: On the Transferability of Adversarial Examples Generated with ResNets. arXiv:2002.05990.

[37]

Cihang Xie, Zhishuai Zhang, Jianyu Wang, Yuyin Zhou, Zhou Ren, and Alan Loddon Yuille. 2019. Improving Transferability of Adversarial Examples with Input Diversity. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2725–2734.

[38]

Han Xu, Yao Ma, Hao-Chen Liu, Debayan Deb, Hui Liu, Ji-Liang Tang, and Anil K. Jain. 2020. Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. International Journal of Automation and Computing 17, 2 (2020), 151–178.

[39]

Zhi-Qin John Xu, Yaoyu Zhang, Tao Luo, Yan Xiao, and Zheng Ma. 2019. Frequency Principle: Fourier Analysis Sheds Light on Deep Neural Networks. arXiv:1901.06523. Retrieved from https://arxiv.org/abs/1901.06523

[40]

Zhi-Qin John Xu, Yaoyu Zhang, and Yan Xiao. 2018. Training Behavior of Deep Neural Network in Frequency Domain. In International Conference on Neural Information Processing, 264–274.

[41]

Zhi-Qin John Xu and Hanxu Zhou. 2020. Deep Frequency Principle towards Understanding Why Deeper Learning Is Faster. In AAAI Conference on Artificial Intelligence, 10541–10550.

[42]

Jianping Zhang, Weibin Wu, Jen tse Huang, Yizhan Huang, Wenxuan Wang, Yuxin Su, and Michael R. Lyu. 2022b. Improving Adversarial Transferability via Neuron Attribution-based Attacks. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14973–14982.

[43]

Qilong Zhang, Xiaodan Li, Yuefeng Chen, Jingkuan Song, Lianli Gao, Yuan He, and Hui Xue. 2022a. Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box Domains. In International Conference on Learning Representations (ICLR).

[44]

Zhengyu Zhao, Zhuoran Liu, and Martha Larson. 2021. On Success and Simplicity: A Second Look at Transferable Targeted Attacks. In Advances in Neural Information Processing Systems (NeurIPS), 6115–6128.

Cited By

Fan WZhao STang J(2025)Introduction for the Special Issue on Trustworthy Artificial IntelligenceACM Transactions on Knowledge Discovery from Data10.1145/371218419:2(1-6)Online publication date: 15-Jan-2025
https://dl.acm.org/doi/10.1145/3712184

Index Terms

Exploiting Pre-Trained Models and Low-Frequency Preference for Cost-Effective Transfer-based Attack
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
  2. Machine learning
2. Security and privacy
  1. Systems security

Recommendations

Direction-aggregated Attack for Transferable Adversarial Examples
Deep neural networks are vulnerable to adversarial examples that are crafted by imposing imperceptible changes to the inputs. However, these adversarial examples are most successful in white-box settings where the model and its parameters are available. ...
Boosting cross‐task adversarial attack with random blur
Abstract
Deep neural networks are highly vulnerable to adversarial examples, and these adversarial examples stay malicious when transferred to other neural networks. Many works exploit this transferability of adversarial examples to execute black‐box ...
Improving transferability of adversarial examples by saliency distribution and data augmentation
Highlights
- We propose a novel attack method to improve the transferability of targeted attacks.
- We introduce the saliency distribution as weight to generate more transferable adversarial perturbations.
- The ordered perspective transformation ...
Abstract
Although deep neural networks (DNNs) have advanced performance in many application scenarios, they are vulnerable to the attacks of adversarial examples that are crafted by adding imperceptible perturbations. Most of the existing adversarial ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data

ACM Transactions on Knowledge Discovery from Data Volume 19, Issue 2

February 2025

651 pages

EISSN:1556-472X

DOI:10.1145/3703012

Editor:
Jian Pei
Duke University

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 February 2025

Online AM: 25 July 2024

Accepted: 14 July 2024

Revised: 09 April 2024

Received: 15 September 2023

Published in TKDD Volume 19, Issue 2

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China
Open Research Fund of the State Key Laboratory of Blockchain and Data Security, Zhejiang University
Alibaba Group through the Alibaba Innovation Research Program

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
227
Total Downloads

Downloads (Last 12 months)227
Downloads (Last 6 weeks)41

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Fan WZhao STang J(2025)Introduction for the Special Issue on Trustworthy Artificial IntelligenceACM Transactions on Knowledge Discovery from Data10.1145/371218419:2(1-6)Online publication date: 15-Jan-2025
https://dl.acm.org/doi/10.1145/3712184

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

Figures

Tables

Media

View full text|Download PDF

View Issue’s Table of Contents