Black-box Membership Inference Attacks against Fine-tuned Diffusion Models
Black-box Membership Inference Attacks against Fine-tuned Diffusion Models
With the rapid advancement of diffusion-based image-generative models, the quality of generated images has become increasingly photorealistic. Moreover, with the release of high-quality pre-trained image-generative models, a growing number of users are downloading these pre-trained models to fine-tune them with downstream datasets for various image-generation tasks. However, employing such powerful pre-trained models in downstream tasks presents significant privacy leakage risks. In this paper, we propose the first reconstruction-based membership inference attack framework, tailored for recent diffusion models, and in the more stringent black-box access setting. Considering four distinct attack scenarios and three types of attacks, this framework is capable of targeting any popular conditional generator model, achieving high precision, evidenced by an impressive AUC of $0.95$.
Yan Pang、Tianhao Wang
计算技术、计算机技术
Yan Pang,Tianhao Wang.Black-box Membership Inference Attacks against Fine-tuned Diffusion Models[EB/OL].(2023-12-13)[2025-08-02].https://arxiv.org/abs/2312.08207.点此复制
评论