|国家预印本平台
首页|Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models

Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models

Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models

来源:Arxiv_logoArxiv
英文摘要

Personalized generation in T2I diffusion models aims to naturally incorporate individual user preferences into the generation process with minimal user intervention. However, existing studies primarily rely on prompt-level modeling with large-scale models, often leading to inaccurate personalization due to the limited input token capacity of T2I diffusion models. To address these limitations, we propose DrUM, a novel method that integrates user profiling with a transformer-based adapter to enable personalized generation through condition-level modeling in the latent space. DrUM demonstrates strong performance on large-scale datasets and seamlessly integrates with open-source text encoders, making it compatible with widely used foundation T2I models without requiring additional fine-tuning.

Hyungjin Kim、Seokho Ahn、Young-Duk Seo

计算技术、计算机技术

Hyungjin Kim,Seokho Ahn,Young-Duk Seo.Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models[EB/OL].(2025-08-05)[2025-08-23].https://arxiv.org/abs/2508.03481.点此复制

评论