Exact Linear Attention 事件
PRODUCT_LAUNCH2026-06-06影响: MEDIUM
Exact Linear Attention arXiv:2605.18848v3 Announce Type: replace-cross Abstract: This paper introduces Exact Linear Attention (ELA), a mechanism that achieves linear computational complexity for Transformer attention by exploiting the exact decomposition property of kernel functions, thereby eliminating approximation error. We identify and address two key limitations of prior linear attention -- gradient explosion and token attention dilution -- by imposing kernel constraints that ensure non-ne
相关产品查看全部 (10)
相关报道查看全部 (1)
Exact Linear Attention
ArXiv CS.AI2026-06-06