Aligning Implied Statements for Implicit Hate Speech Generalizability with Context-Bounded Semi-hard Negative Mining 文章

ArXiv CS.CL2026-06-18NEWSen作者: Wicaksono Leksono Muhamad, Yunita Sari

详细信息

来源站点
ArXiv CS.CL
作者
Wicaksono Leksono Muhamad, Yunita Sari
文章类型
NEWS
语言
en
发布日期
2026-06-18

摘要

arXiv:2606.18852v1 Announce Type: new Abstract: Classifying implicit hate speech remains a challenge, as intent is often masked through insinuation and context rather than explicit slurs. Prior supervised contrastive approaches improve in-domain detection but can overfit surface cues and struggle to transfer across datasets. We propose ImpSH, a triplet-based framework that aligns posts with implied statements when available and uses context-bounded semi-hard negatives to focus learning on near confusions. We also examine AugSH, which forms positives via data augmentation. In controlled evaluations on IHC, SBIC, and DynaHate with BERT and HateBERT, ImpSH is a viable alternative to standard supervised contrastive baselines and often improves cross-domain performance under matched preprocessing and tuning budgets.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据