|国家预印本平台
首页|Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis

Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis

Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis

来源:Arxiv_logoArxiv
英文摘要

Generative latent diffusion models have been established as state-of-the-art in data generation. One promising application is generation of realistic synthetic medical imaging data for open data sharing without compromising patient privacy. Despite the promise, the capacity of such models to memorize sensitive patient training data and synthesize samples showing high resemblance to training data samples is relatively unexplored. Here, we assess the memorization capacity of 3D latent diffusion models on photon-counting coronary computed tomography angiography and knee magnetic resonance imaging datasets. To detect potential memorization of training samples, we utilize self-supervised models based on contrastive learning. Our results suggest that such latent diffusion models indeed memorize training data, and there is a dire need for devising strategies to mitigate memorization.

Jannik Kahmann、Salman Ul Hassan Dar、Sandy Engelhardt、Isabelle Ayx、Theano Papavassiliu、Arman Ghanaat、Stefan O. Schoenberg

医学研究方法生物科学现状、生物科学发展计算技术、计算机技术

Jannik Kahmann,Salman Ul Hassan Dar,Sandy Engelhardt,Isabelle Ayx,Theano Papavassiliu,Arman Ghanaat,Stefan O. Schoenberg.Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis[EB/OL].(2023-07-03)[2025-08-02].https://arxiv.org/abs/2307.01148.点此复制

评论