|国家预印本平台
首页|PositionIC: Unified Position and Identity Consistency for Image Customization

PositionIC: Unified Position and Identity Consistency for Image Customization

PositionIC: Unified Position and Identity Consistency for Image Customization

来源:Arxiv_logoArxiv
英文摘要

Recent subject-driven image customization has achieved significant advancements in fidelity, yet fine-grained instance-level spatial control remains elusive, hindering broader real-world application. This limitation is mainly attributed to the absence of scalable datasets that bind identity with precise positional cues. To this end, we introduce PositionIC, a unified framework that enforces position and identity consistency for multi-subject customization. We construct a scalable synthesis pipeline that employs a bidirectional generation paradigm to eliminate subject drift and maintain semantic coherence. On top of these data, we design a lightweight positional modulation operation that decouples spatial embeddings among subjects, enabling independent, accurate placement while preserving visual fidelity. Extensive experiments demonstrate that our approach can achieve precise spatial control while maintaining high consistency in image customization tasks. PositionIC paves the way for controllable, high-fidelity image customization in open-world, multi-entity scenarios and will be released to foster further research.

Junjie Hu、Tianyang Han、Kai Ma、Jialin Gao、Hao Dou、Song Yang、Xianhua He、Jianhui Zhang、Junfeng Luo、Xiaoming Wei、Wenqiang Zhang

计算技术、计算机技术

Junjie Hu,Tianyang Han,Kai Ma,Jialin Gao,Hao Dou,Song Yang,Xianhua He,Jianhui Zhang,Junfeng Luo,Xiaoming Wei,Wenqiang Zhang.PositionIC: Unified Position and Identity Consistency for Image Customization[EB/OL].(2025-08-05)[2025-08-10].https://arxiv.org/abs/2507.13861.点此复制

评论