UNIQUE: Universal Top-k Sparse Attention for Training-free Inference and Sparsity-aware Training 文章

ArXiv CS.CL2026-05-28NEWSen作者: Keqi Deng, Shaoshi Ling, Ruchao Fan, Jinyu Li

UNIQUE: Universal Top-k Sparse Attention for Training-free Inference and Sparsity-aware Training · 相关技术