Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection
Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection
Bioacoustic sound event detection (BioSED) is crucial for biodiversity conservation but faces practical challenges during model development and training: limited amounts of annotated data, sparse events, species diversity, and class imbalance. To address these challenges efficiently with a limited labeling budget, we apply the mismatch-first farthest-traversal (MFFT), an active learning method integrating committee voting disagreement and diversity analysis. We also refine an existing BioSED dataset specifically for evaluating active learning algorithms. Experimental results demonstrate that MFFT achieves a mAP of 68% when cold-starting and 71% when warm-starting (which is close to the fully-supervised mAP of 75%) while using only 2.3% of the annotations. Notably, MFFT excels in cold-start scenarios and with rare species, which are critical for monitoring endangered species, demonstrating its practical value.
Tuomas Virtanen、Shiqi Zhang
生物科学理论、生物科学方法生物科学研究方法、生物科学研究技术
Tuomas Virtanen,Shiqi Zhang.Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection[EB/OL].(2025-05-27)[2025-06-14].https://arxiv.org/abs/2505.20956.点此复制
评论