首页|Consistent View Synthesis with Pose-Guided Diffusion Models

Consistent View Synthesis with Pose-Guided Diffusion Models

来源：

英文摘要

Novel view synthesis from a single image has been a cornerstone problem for many Virtual Reality applications that provide immersive experiences. However, most existing techniques can only synthesize novel views within a limited range of camera motion or fail to generate consistent and high-quality novel views under significant camera movement. In this work, we propose a pose-guided diffusion model to generate a consistent long-term video of novel views from a single image. We design an attention layer that uses epipolar lines as constraints to facilitate the association between different viewpoints. Experimental results on synthetic and real-world datasets demonstrate the effectiveness of the proposed diffusion model against state-of-the-art transformer-based and GAN-based approaches.

作者：Changil Kim、Qinbo Li、Hung-Yu Tseng、Suhib Alsisan、Jia-Bin Huang、Johannes Kopf

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Changil Kim,Qinbo Li,Hung-Yu Tseng,Suhib Alsisan,Jia-Bin Huang,Johannes Kopf.Consistent View Synthesis with Pose-Guided Diffusion Models[EB/OL].(2023-03-30)[2025-08-05].https://arxiv.org/abs/2303.17598.点此复制

Consistent View Synthesis with Pose-Guided Diffusion Models

Consistent View Synthesis with Pose-Guided Diffusion Models

评论