|国家预印本平台
首页|Dynamic Camera Poses and Where to Find Them

Dynamic Camera Poses and Where to Find Them

Dynamic Camera Poses and Where to Find Them

来源:Arxiv_logoArxiv
英文摘要

Annotating camera poses on dynamic Internet videos at scale is critical for advancing fields like realistic video generation and simulation. However, collecting such a dataset is difficult, as most Internet videos are unsuitable for pose estimation. Furthermore, annotating dynamic Internet videos present significant challenges even for state-of-theart methods. In this paper, we introduce DynPose-100K, a large-scale dataset of dynamic Internet videos annotated with camera poses. Our collection pipeline addresses filtering using a carefully combined set of task-specific and generalist models. For pose estimation, we combine the latest techniques of point tracking, dynamic masking, and structure-from-motion to achieve improvements over the state-of-the-art approaches. Our analysis and experiments demonstrate that DynPose-100K is both large-scale and diverse across several key attributes, opening up avenues for advancements in various downstream applications.

Chris Rockwell、Joseph Tung、Tsung-Yi Lin、Ming-Yu Liu、David F. Fouhey、Chen-Hsuan Lin

计算技术、计算机技术

Chris Rockwell,Joseph Tung,Tsung-Yi Lin,Ming-Yu Liu,David F. Fouhey,Chen-Hsuan Lin.Dynamic Camera Poses and Where to Find Them[EB/OL].(2025-04-24)[2025-05-07].https://arxiv.org/abs/2504.17788.点此复制

评论