Auditing Stance Asymmetry in Generative Explanations 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
Auditing Stance Asymmetry in Generative Explanations arXiv:2605.27988v1 Announce Type: new Abstract: Bias evaluation for language models has made substantial progress on bounded comparisons, such as overt derogation, stereotype association, or label-sensitive differences under controlled substitutions. Open-ended explanations raise a different problem: they guide interpretation by assigning responsibility, legitimacy, context, and grievance. A model can avoid hostile language while making one s
相关产品查看全部 (10)
相关报道查看全部 (1)
Auditing Stance Asymmetry in Generative Explanations
ArXiv CS.CL2026-05-28