首页|DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

来源：

英文摘要

Text-to-3D generation automates 3D content creation from textual descriptions, which offers transformative potential across various fields. However, existing methods often struggle to align generated content with human preferences, limiting their applicability and flexibility. To address these limitations, in this paper, we propose DreamDPO, an optimization-based framework that integrates human preferences into the 3D generation process, through direct preference optimization. Practically, DreamDPO first constructs pairwise examples, then compare their alignment with human preferences using reward or large multimodal models, and lastly optimizes the 3D representation with a preference-driven loss function. By leveraging pairwise comparison to reflect preferences, DreamDPO reduces reliance on precise pointwise quality evaluations while enabling fine-grained controllability through preference-guided optimization. Experiments demonstrate that DreamDPO achieves competitive results, and provides higher-quality and more controllable 3D content compared to existing methods. The code and models will be open-sourced.

作者：Yi Yang、Hehe Fan、Tat-Seng Chua、Xiaobo Xia、Fan Ma、Zhenglin Zhou

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Yi Yang,Hehe Fan,Tat-Seng Chua,Xiaobo Xia,Fan Ma,Zhenglin Zhou.DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization[EB/OL].(2025-02-05)[2025-08-02].https://arxiv.org/abs/2502.04370.点此复制

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

评论