Singular Vectors of Attention Heads Align with Features 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
Singular Vectors of Attention Heads Align with Features arXiv:2602.13524v2 Announce Type: replace-cross Abstract: Identifying feature representations in language models is a central task in mechanistic interpretability. Several recent studies have made the observation that feature representations can be inferred in some cases from singular vectors of attention matrices. However, sound justification for this phenomenon is lacking. In this paper we address that question, asking: why and when do s
相关产品查看全部 (10)
相关报道查看全部 (1)
Singular Vectors of Attention Heads Align with Features
ArXiv CS.AI2026-05-28