OmniMem: Scalable and Adaptive Memory Retrieval for Long Video Generation 文章

ArXiv CS.CV2026-06-01NEWSen作者: Lin Zhao, Yushu Wu, Yifan Gong, Yanzhi Wang, Pu Zhao

摘要

arXiv:2605.30519v1 Announce Type: new Abstract: Autoregressive (AR) video generation extends videos by producing latent chunks sequentially, but scaling to long videos requires repeated access to a growing historical KV cache. Existing methods reduce this cost by truncating the KV cache or compressing it into implicit memory, but both lose explicit access to query-relevant historical details. We propose OmniMem, an explicit full-range memory retrieval framework that performs sparse KV retrieval over the historical cache. To make this practical for chunk-based AR video generation, OmniMem addresses two issues: (i) local bias in sparse KV selection and (ii) Union Explosion in memory access. Adaptive Window Exclusion removes local-window blocks from the selection candidates when sufficient long-range history is available, preserving the sparse budget for informative long-range retrieval.

相关公司

暂无数据

相关人物

暂无数据