首页|HateBuffer: Safeguarding Content Moderators' Mental Well-Being through Hate Speech Content Modification

HateBuffer: Safeguarding Content Moderators' Mental Well-Being through Hate Speech Content Modification

来源：

英文摘要

Hate speech remains a persistent and unresolved challenge in online platforms. Content moderators, working on the front lines to review user-generated content and shield viewers from hate speech, often find themselves unprotected from the mental burden as they continuously engage with offensive language. To safeguard moderators' mental well-being, we designed HateBuffer, which anonymizes targets of hate speech, paraphrases offensive expressions into less offensive forms, and shows the original expressions when moderators opt to see them. Our user study with 80 participants consisted of a simulated hate speech moderation task set on a fictional news platform, followed by semi-structured interviews. Although participants rated the hate severity of comments lower while using HateBuffer, contrary to our expectations, they did not experience improved emotion or reduced fatigue compared with the control group. In interviews, however, participants described HateBuffer as an effective buffer against emotional contagion and the normalization of biased opinions in hate speech. Notably, HateBuffer did not compromise moderation accuracy and even contributed to a slight increase in recall. We explore possible explanations for the discrepancy between the perceived benefits of HateBuffer and its measured impact on mental well-being. We also underscore the promise of text-based content modification techniques as tools for a healthier content moderation environment.

作者：Subin Park、Jeonghyun Kim、Jeanne Choi、Joseph Seering、Uichin Lee、Sung-Ju Lee

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Subin Park,Jeonghyun Kim,Jeanne Choi,Joseph Seering,Uichin Lee,Sung-Ju Lee.HateBuffer: Safeguarding Content Moderators' Mental Well-Being through Hate Speech Content Modification[EB/OL].(2025-08-01)[2025-08-11].https://arxiv.org/abs/2508.00439.点此复制

HateBuffer: Safeguarding Content Moderators' Mental Well-Being through Hate Speech Content Modification

HateBuffer: Safeguarding Content Moderators' Mental Well-Being through Hate Speech Content Modification

评论