SCRum-9: Multilingual Stance Classification over Rumours on Social Media
SCRum-9: Multilingual Stance Classification over Rumours on Social Media
We introduce SCRum-9, a multilingual dataset for Rumour Stance Classification, containing 7,516 tweet-reply pairs from X. SCRum-9 goes beyond existing stance classification datasets by covering more languages (9), linking examples to more fact-checked claims (2.1k), and including complex annotations from multiple annotators to account for intra- and inter-annotator variability. Annotations were made by at least three native speakers per language, totalling around 405 hours of annotation and 8,150 dollars in compensation. Experiments on SCRum-9 show that it is a challenging benchmark for both state-of-the-art LLMs (e.g. Deepseek) as well as fine-tuned pre-trained models, motivating future work in this area.
Yue Li、Jake Vasilakes、Zhixue Zhao、Carolina Scarton
语言学
Yue Li,Jake Vasilakes,Zhixue Zhao,Carolina Scarton.SCRum-9: Multilingual Stance Classification over Rumours on Social Media[EB/OL].(2025-05-24)[2025-06-27].https://arxiv.org/abs/2505.18916.点此复制
评论