|国家预印本平台
首页|The Moral Foundations Weibo Corpus

The Moral Foundations Weibo Corpus

The Moral Foundations Weibo Corpus

来源:Arxiv_logoArxiv
英文摘要

Moral sentiments expressed in natural language significantly influence both online and offline environments, shaping behavioral styles and interaction patterns, including social media selfpresentation, cyberbullying, adherence to social norms, and ethical decision-making. To effectively measure moral sentiments in natural language processing texts, it is crucial to utilize large, annotated datasets that provide nuanced understanding for accurate analysis and modeltraining. However, existing corpora, while valuable, often face linguistic limitations. To address this gap in the Chinese language domain,we introduce the Moral Foundation Weibo Corpus. This corpus consists of 25,671 Chinese comments on Weibo, encompassing six diverse topic areas. Each comment is manually annotated by at least three systematically trained annotators based on ten moral categories derived from a grounded theory of morality. To assess annotator reliability, we present the kappa testresults, a gold standard for measuring consistency. Additionally, we apply several the latest large language models to supplement the manual annotations, conducting analytical experiments to compare their performance and report baseline results for moral sentiment classification.

Miaoyan Hu、Baha Ihnaini、Renjie Cao、Jiahan Wei

文化理论信息传播、知识传播计算技术、计算机技术

Miaoyan Hu,Baha Ihnaini,Renjie Cao,Jiahan Wei.The Moral Foundations Weibo Corpus[EB/OL].(2024-11-14)[2025-07-16].https://arxiv.org/abs/2411.09612.点此复制

评论