Query Circuits: Explaining How Language Models Answer User Prompts 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Query Circuits: Explaining How Language Models Answer User Prompts arXiv:2509.24808v2 Announce Type: replace Abstract: Explaining why a language model produces a particular output requires local, input-level explanations. Existing methods uncover global capability circuits (e.g., indirect object identification), but not why the model answers a specific input query in a particular way. We introduce query circuits, which directly trace the information flow inside a model that maps a specific inpu

Query Circuits: Explaining How Language Models Answer User Prompts · 相关报道