|国家预印本平台
首页|Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization

Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization

Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization

来源:Arxiv_logoArxiv
英文摘要

Sound Event Localization and Detection (SELD) combines the Sound Event Detection (SED) with the corresponding Direction Of Arrival (DOA). Recently, adopted event oriented multi-track methods affect the generality in polyphonic environments due to the limitation of the number of tracks. To enhance the generality in polyphonic environments, we propose Spatial Mapping and Regression Localization for SELD (SMRL-SELD). SMRL-SELD segments the 3D spatial space, mapping it to a 2D plane, and a new regression localization loss is proposed to help the results converge toward the location of the corresponding event. SMRL-SELD is location-oriented, allowing the model to learn event features based on orientation. Thus, the method enables the model to process polyphonic sounds regardless of the number of overlapping events. We conducted experiments on STARSS23 and STARSS22 datasets and our proposed SMRL-SELD outperforms the existing SELD methods in overall evaluation and polyphony environments.

Yunfei Zi、Xueping Zhang、Shengwu Xiong、Yaxiong Chen、Ruilin Yao

无线通信

Yunfei Zi,Xueping Zhang,Shengwu Xiong,Yaxiong Chen,Ruilin Yao.Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization[EB/OL].(2025-04-11)[2025-05-17].https://arxiv.org/abs/2504.08365.点此复制

评论