Detection vs. Execution: Single-Bucket Probes Miss Half the Mamba-2 State Sink 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Detection vs. Execution: Single-Bucket Probes Miss Half the Mamba-2 State Sink arXiv:2606.00930v1 Announce Type: new Abstract: Mechanistic interpretability often assumes that probes identifying a representational signature also identify the circuit executing the corresponding computation. We show that this assumption can fail systematically in Mamba-2. Studying the state sink (disproportionate Delta-gate activation on boundary tokens, analogous to the attention sink), we find that single-bucket

Detection vs. Execution: Single-Bucket Probes Miss Half the Mamba-2 State Sink · 相关报道