Correcting Gradient-Based Circuit Localization via Interaction-Aware Backpropagation 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Correcting Gradient-Based Circuit Localization via Interaction-Aware Backpropagation arXiv:2505.17630v4 Announce Type: replace Abstract: Circuit localization methods aim to identify the subset of model components responsible for specific behaviors in large language models, enabling detailed mechanistic analysis. Most existing methods assume components act independently and estimate importance by perturbing each component in isolation. However, components in neural networks interact, and ignorin
Correcting Gradient-Based Circuit Localization via Interaction-Aware Backpropagation · 相关人物
暂无数据