Dealing with Annotator Disagreement in Hate Speech Classification 文章

ArXiv CS.CL2026-06-16NEWSen作者: Somaiyeh Dehghan, Mehmet Umut Sen, Berrin Yanikoglu

详细信息

来源站点: ArXiv CS.CL
作者: Somaiyeh Dehghan, Mehmet Umut Sen, Berrin Yanikoglu
文章类型: NEWS
语言: en
发布日期: 2026-06-16

摘要

arXiv:2502.08266v4 Announce Type: replace Abstract: Hate speech detection is a crucial task, especially on social media where harmful content can spread quickly. Collecting social media content (tweets etc.) to train machine learning models is easy, but detecting and categorizing hate speech can be difficult due to the inherently subjective nature. This subjectivity leads to frequent disagreement among annotators, particularly for subtle or borderline content. Traditional approaches either discard non-consensus samples or force a ''gold standard'' through expert adjudication, ignoring valuable information about uncertainty and diverse human perspectives. We examine the largely overlooked problem of annotator disagreement in hate speech classification and evaluate a range of aggregation methods, including majority voting, ordinal strategies (minimum, maximum, and mean), and analyze their impact across binary, 4-class, and 6-class classification tasks.

Dealing with Annotator Disagreement in Hate Speech Classification 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术