From Self to Other: Evaluating Demographic Perspective-Taking in LLM Hate Speech Annotation 事件
PRODUCT_LAUNCH2026-06-05影响: MEDIUM
From Self to Other: Evaluating Demographic Perspective-Taking in LLM Hate Speech Annotation arXiv:2606.06266v1 Announce Type: new Abstract: Hate speech detection is inherently subjective: people from different demographic groups perceive the same content very differently. Collecting enough annotations from multiple demographic groups is costly and difficult to scale. Persona-conditioned Large Language Models (models prompted to adopt a specific demographic identity) have been proposed as a way