Computer Science > Machine Learning

arXiv:2304.04625 (cs)

[Submitted on 10 Apr 2023]

Title:Reinforcement Learning-Based Black-Box Model Inversion Attacks

Authors:Gyojin Han, Jaehyun Choi, Haeil Lee, Junmo Kim

View PDF

Abstract:Model inversion attacks are a type of privacy attack that reconstructs private data used to train a machine learning model, solely by accessing the model. Recently, white-box model inversion attacks leveraging Generative Adversarial Networks (GANs) to distill knowledge from public datasets have been receiving great attention because of their excellent attack performance. On the other hand, current black-box model inversion attacks that utilize GANs suffer from issues such as being unable to guarantee the completion of the attack process within a predetermined number of query accesses or achieve the same level of performance as white-box attacks. To overcome these limitations, we propose a reinforcement learning-based black-box model inversion attack. We formulate the latent space search as a Markov Decision Process (MDP) problem and solve it with reinforcement learning. Our method utilizes the confidence scores of the generated images to provide rewards to an agent. Finally, the private data can be reconstructed using the latent vectors found by the agent trained in the MDP. The experiment results on various datasets and models demonstrate that our attack successfully recovers the private information of the target model by achieving state-of-the-art attack performance. We emphasize the importance of studies on privacy-preserving machine learning by proposing a more advanced black-box model inversion attack.

Comments:	CVPR 2023, Accepted
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2304.04625 [cs.LG]
	(or arXiv:2304.04625v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2304.04625

Submission history

From: Gyojin Han [view email]
[v1] Mon, 10 Apr 2023 14:41:16 UTC (329 KB)

Computer Science > Machine Learning

Title:Reinforcement Learning-Based Black-Box Model Inversion Attacks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reinforcement Learning-Based Black-Box Model Inversion Attacks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators