Don't Read Everything: A Curvature-Conditioned Query for Linear Attention 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Don't Read Everything: A Curvature-Conditioned Query for Linear Attention arXiv:2606.01294v1 Announce Type: new Abstract: Linear attention reduces the quadratic cost of softmax attention by maintaining a recurrent fast-weight state, but it consistently lags on in-context retrieval and long-context tasks. Existing remedies act on the write side of memory through gating, delta updates, or kernel feature maps, but the read step is left unchanged: every past key contributes additively to the output