首页|A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

来源：

英文摘要

Content moderation research has recently made significant advances, but still fails to serve the majority of the world's languages due to the lack of resources, leaving millions of vulnerable users to online hostility. This work presents a large-scale human-annotated multi-task benchmark dataset for abusive language detection in Tigrinya social media with joint annotations for three tasks: abusiveness, sentiment, and topic classification. The dataset comprises 13,717 YouTube comments annotated by nine native speakers, collected from 7,373 videos with a total of over 1.2 billion views across 51 channels. We developed an iterative term clustering approach for effective data selection. Recognizing that around 64% of Tigrinya social media content uses Romanized transliterations rather than native Ge'ez script, our dataset accommodates both writing systems to reflect actual language use. We establish strong baselines across the tasks in the benchmark, while leaving significant challenges for future contributions. Our experiments reveal that small, specialized multi-task models outperform the current frontier models in the low-resource setting, achieving up to 86% accuracy (+7 points) in abusiveness detection. We make the resources publicly available to promote research on online safety.

作者：Fitsum Gaim、Hoyun Song、Huije Lee、Changgeon Ko、Eui Jun Hwang、Jong C. Park

作者单位：

学科分类：闪-含语系（阿非罗-亚细亚语系）计算技术、计算机技术

推荐引用：Fitsum Gaim,Hoyun Song,Huije Lee,Changgeon Ko,Eui Jun Hwang,Jong C. Park.A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings[EB/OL].(2025-05-17)[2025-08-02].https://arxiv.org/abs/2505.12116.点此复制

A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

评论