ToosiCubix: Monocular 3D Cuboid Labeling via Vehicle Part Annotations
ToosiCubix: Monocular 3D Cuboid Labeling via Vehicle Part Annotations
Many existing methods for 3D cuboid annotation of vehicles rely on expensive and carefully calibrated camera-LiDAR or stereo setups, limiting their accessibility for large-scale data collection. We introduce ToosiCubix, a simple yet powerful approach for annotating ground-truth cuboids using only monocular images and intrinsic camera parameters. Our method requires only about 10 user clicks per vehicle, making it highly practical for adding 3D annotations to existing datasets originally collected without specialized equipment. By annotating specific features (e.g., wheels, car badge, symmetries) across different vehicle parts, we accurately estimate each vehicle's position, orientation, and dimensions up to a scale ambiguity (8 DoF). The geometric constraints are formulated as an optimization problem, which we solve using a coordinate descent strategy, alternating between Perspective-n-Points (PnP) and least-squares subproblems. To handle common ambiguities such as scale and unobserved dimensions, we incorporate probabilistic size priors, enabling 9 DoF cuboid placements. We validate our annotations against the KITTI and Cityscapes3D datasets, demonstrating that our method offers a cost-effective and scalable solution for high-quality 3D cuboid annotation.
Behrooz Nasihatkon、Hossein Resani、Amirreza Mehrzadian
计算技术、计算机技术
Behrooz Nasihatkon,Hossein Resani,Amirreza Mehrzadian.ToosiCubix: Monocular 3D Cuboid Labeling via Vehicle Part Annotations[EB/OL].(2025-06-26)[2025-07-16].https://arxiv.org/abs/2506.21358.点此复制
评论