Dealing with Annotator Disagreement in Hate Speech Classification 文章

ArXiv CS.CL2026-06-16NEWSen作者: Somaiyeh Dehghan, Mehmet Umut Sen, Berrin Yanikoglu

详细信息

来源站点
ArXiv CS.CL
作者
Somaiyeh Dehghan, Mehmet Umut Sen, Berrin Yanikoglu
文章类型
NEWS
语言
en
发布日期
2026-06-16

摘要

arXiv:2502.08266v4 Announce Type: replace Abstract: Hate speech detection is a crucial task, especially on social media where harmful content can spread quickly. Collecting social media content (tweets etc.) to train machine learning models is easy, but detecting and categorizing hate speech can be difficult due to the inherently subjective nature. This subjectivity leads to frequent disagreement among annotators, particularly for subtle or borderline content. Traditional approaches either discard non-consensus samples or force a ''gold standard'' through expert adjudication, ignoring valuable information about uncertainty and diverse human perspectives. We examine the largely overlooked problem of annotator disagreement in hate speech classification and evaluate a range of aggregation methods, including majority voting, ordinal strategies (minimum, maximum, and mean), and analyze their impact across binary, 4-class, and 6-class classification tasks.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据

相关技术

暂无数据