RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking
RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking
We introduce a robust framework, RGBTrack, for real-time 6D pose estimation and tracking that operates solely on RGB data, thereby eliminating the need for depth input for such dynamic and precise object pose tracking tasks. Building on the FoundationPose architecture, we devise a novel binary search strategy combined with a render-and-compare mechanism to efficiently infer depth and generate robust pose hypotheses from true-scale CAD models. To maintain stable tracking in dynamic scenarios, including rapid movements and occlusions, RGBTrack integrates state-of-the-art 2D object tracking (XMem) with a Kalman filter and a state machine for proactive object pose recovery. In addition, RGBTrack's scale recovery module dynamically adapts CAD models of unknown scale using an initial depth estimate, enabling seamless integration with modern generative reconstruction techniques. Extensive evaluations on benchmark datasets demonstrate that RGBTrack's novel depth-free approach achieves competitive accuracy and real-time performance, making it a promising practical solution candidate for application areas including robotics, augmented reality, and computer vision. The source code for our implementation will be made publicly available at https://github.com/GreatenAnoymous/RGBTrack.git.
Teng Guo、Jingjin Yu
计算技术、计算机技术
Teng Guo,Jingjin Yu.RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking[EB/OL].(2025-06-20)[2025-06-30].https://arxiv.org/abs/2506.17119.点此复制
评论