Multi-component Causal Tracing in Large Language Models 事件

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

Multi-component Causal Tracing in Large Language Models arXiv:2606.03085v1 Announce Type: cross Abstract: Causal tracing systematically intervenes on a large language model's (LLM's) internal representations to uncover and quantify the causal pathways linking specific inputs or computations to specific metrics of interest, quantifying the LLM's behavior. Building on previous single-component or single-layer studies, this paper presents a unified framework for causally tracing multiple component