|国家预印本平台
首页|BOISHOMMO: Holistic Approach for Bangla Hate Speech

BOISHOMMO: Holistic Approach for Bangla Hate Speech

BOISHOMMO: Holistic Approach for Bangla Hate Speech

来源:Arxiv_logoArxiv
英文摘要

One of the most alarming issues in digital society is hate speech (HS) on social media. The severity is so high that researchers across the globe are captivated by this domain. A notable amount of work has been conducted to address the identification and alarm system. However, a noticeable gap exists, especially for low-resource languages. Comprehensive datasets are the main problem among the constrained resource languages, such as Bangla. Interestingly, hate speech or any particular speech has no single dimensionality. Similarly, the hate component can simultaneously have multiple abusive attributes, which seems to be missed in the existing datasets. Thus, a multi-label Bangla hate speech dataset named BOISHOMMO has been compiled and evaluated in this work. That includes categories of HS across race, gender, religion, politics, and more. With over two thousand annotated examples, BOISHOMMO provides a nuanced understanding of hate speech in Bangla and highlights the complexities of processing non-Latin scripts. Apart from evaluating with multiple algorithmic approaches, it also highlights the complexities of processing Bangla text and assesses model performance. This unique multi-label approach enriches future hate speech detection and analysis studies for low-resource languages by providing a more nuanced, diverse dataset.

Md Abdullah Al Kafi、Sumit Kumar Banshal、Md Sadman Shakib、Showrov Azam、Tamanna Alam Tabashom

南亚语系(澳斯特罗-亚细亚语系)计算技术、计算机技术

Md Abdullah Al Kafi,Sumit Kumar Banshal,Md Sadman Shakib,Showrov Azam,Tamanna Alam Tabashom.BOISHOMMO: Holistic Approach for Bangla Hate Speech[EB/OL].(2025-04-11)[2025-05-18].https://arxiv.org/abs/2504.08408.点此复制

评论