Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context 文章

ArXiv CS.CL2026-05-29NEWSen作者: Yoav Gur-Arieh, Mor Geva, Atticus Geiger

详细信息

来源站点: ArXiv CS.CL
作者: Yoav Gur-Arieh, Mor Geva, Atticus Geiger
文章类型: NEWS
语言: en
发布日期: 2026-05-29

摘要

arXiv:2510.06182v2 Announce Type: replace Abstract: A key component of in-context reasoning is the ability of language models (LMs) to bind entities for later retrieval. For example, an LM might represent "Ann loves pie" by binding "Ann" to "pie", allowing it to later retrieve "Ann" when asked "Who loves pie?" Prior research on short lists of bound entities found strong evidence that LMs implement such retrieval via a positional mechanism, where "Ann" is retrieved based on its position in context. In this work, we find that this mechanism generalizes poorly to more complex settings; as the number of bound entities in context increases, the positional mechanism becomes noisy and unreliable in middle positions. To compensate for this, we find that LMs supplement the positional mechanism with a lexical mechanism (retrieving "Ann" using its bound counterpart "pie") and a reflexive mechanism (retrieving "Ann" through a direct pointer).

Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (3)