LVSA: Training-Free Sparse Attention for Long Video Diffusion 事件

BREAKTHROUGH2026-06-01影响: HIGH

LVSA: Training-Free Sparse Attention for Long Video Diffusion arXiv:2605.31057v1 Announce Type: new Abstract: Dense self-attention is the compute and quality bottleneck of long-video diffusion inference: cost grows quadratically with the sequence length, and beyond the training horizon the model converges to near-static output, that is, "frozen" repetitive video. State of the art approaches are either too costly, e.g., they require retraining, or fail to satisfy both performance and quality obj