Cultural Binding Heads in Language Models 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Cultural Binding Heads in Language Models arXiv:2605.28543v1 Announce Type: cross Abstract: LLMs often default to equal treatment across cultural groups, even though context warrants differentiation: this is a lack of difference awareness. Using mechanistic interpretability and a factorial design on the N4 cultural appropriation benchmark from Wang et al. (2025), we identify 2-3 mid-layer attention heads per model that contribute causally to cultural binding across eight models (four architectu

Cultural Binding Heads in Language Models · 相关公司

N
NISTGOVERNMENT
A
arXivNONPROFIT
A
ARENEGOVERNMENT
A
AnisNONPROFIT
E
EATNONPROFIT
L
LoweCOMPANY
A
ACTNONPROFIT
I
IdentityNONPROFIT
C
CulturaGOVERNMENT