When Does Demographic Information Help? Data and Modeling Regimes for Perspective-Aware Hate Speech Detection 文章

ArXiv CS.CL2026-05-27NEWSen作者: Weibin Cai, Reza Zafarani

摘要

arXiv:2605.27313v1 Announce Type: new Abstract: Demographic information is often used to model annotator perspectives in subjective tasks such as hate speech detection, but its benefit is inconsistent: it improves performance in some settings and behaves as noise in others. This paper asks when demographic features help. We analyze demographic gain as a function of both data split properties and modeling frameworks. For data splits, we measure annotator disagreement, namely how often annotators assign different labels to the same example, along with training size and train-test demographic coverage. We find that demographic gains concentrate in regimes with low training disagreement, high test disagreement, fine-grained ambiguity measurement, sufficient training data, and greater demographic overlap. Motivated by these regimes, we introduce a gated demographic residual model that treats demographics as a selective adjustment to text-only predictions.