RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements
RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements
Recent advances in camera-controllable video generation have been constrained by the reliance on static-scene datasets with relative-scale camera annotations, such as RealEstate10K. While these datasets enable basic viewpoint control, they fail to capture dynamic scene interactions and lack metric-scale geometric consistency-critical for synthesizing realistic object motions and precise camera trajectories in complex environments. To bridge this gap, we introduce the first fully open-source, high-resolution dynamic-scene dataset with metric-scale camera annotations in https://github.com/ZGCTroy/RealCam-Vid.
Guangcong Zheng、Teng Li、Xianpan Zhou、Xi Li
计算技术、计算机技术
Guangcong Zheng,Teng Li,Xianpan Zhou,Xi Li.RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements[EB/OL].(2025-04-10)[2025-05-13].https://arxiv.org/abs/2504.08212.点此复制
评论