PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples
PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples
We present PartComposer: a framework for part-level concept learning from single-image examples that enables text-to-image diffusion models to compose novel objects from meaningful components. Existing methods either struggle with effectively learning fine-grained concepts or require a large dataset as input. We propose a dynamic data synthesis pipeline generating diverse part compositions to address one-shot data scarcity. Most importantly, we propose to maximize the mutual information between denoised latents and structured concept codes via a concept predictor, enabling direct regulation on concept disentanglement and re-composition supervision. Our method achieves strong disentanglement and controllable composition, outperforming subject and part-level baselines when mixing concepts from the same, or different, object categories.
Junyu Liu、R. Kenny Jones、Daniel Ritchie
计算技术、计算机技术
Junyu Liu,R. Kenny Jones,Daniel Ritchie.PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples[EB/OL].(2025-06-03)[2025-07-02].https://arxiv.org/abs/2506.03004.点此复制
评论