|国家预印本平台
首页|Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection

Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection

Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection

来源:Arxiv_logoArxiv
英文摘要

A crucial yet under-appreciated prerequisite in machine learning solutions for real-applications is data annotation: human annotators are hired to manually label data according to detailed, expert-crafted guidelines. This is often a laborious, tedious, and costly process. To study methods for facilitating data annotation, we introduce a new benchmark AnnoGuide: Auto-Annotation from Annotation Guidelines. It aims to evaluate automated methods for data annotation directly from expert-defined annotation guidelines, eliminating the need for manual labeling. As a case study, we repurpose the well-established nuScenes dataset, commonly used in autonomous driving research, which provides comprehensive annotation guidelines for labeling LiDAR point clouds with 3D cuboids across 18 object classes. These guidelines include a few visual examples and textual descriptions, but no labeled 3D cuboids in LiDAR data, making this a novel task of multi-modal few-shot 3D detection without 3D annotations. The advances of powerful foundation models (FMs) make AnnoGuide especially timely, as FMs offer promising tools to tackle its challenges. We employ a conceptually straightforward pipeline that (1) utilizes open-source FMs for object detection and segmentation in RGB images, (2) projects 2D detections into 3D using known camera poses, and (3) clusters LiDAR points within the frustum of each 2D detection to generate a 3D cuboid. Starting with a non-learned solution that leverages off-the-shelf FMs, we progressively refine key components and achieve significant performance improvements, boosting 3D detection mAP from 12.1 to 21.9! Nevertheless, our results highlight that AnnoGuide remains an open and challenging problem, underscoring the urgent need for developing LiDAR-based FMs. We release our code and models at GitHub: https://annoguide.github.io/annoguide3Dbenchmark

Yechi Ma、Wei Hua、Shu Kong

计算技术、计算机技术

Yechi Ma,Wei Hua,Shu Kong.Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection[EB/OL].(2025-06-03)[2025-06-25].https://arxiv.org/abs/2506.02914.点此复制

评论