GHOST: Geometry-Hierarchical Online Streaming Token Eviction for Efficient 3D Reconstruction 文章

ArXiv CS.CV2026-05-29NEWSen作者: Leyang Chen, Junyi Wu, Zhiteng Li, Yulun Zhang

摘要

arXiv:2605.15852v2 Announce Type: replace Abstract: Streaming 3D reconstruction from long monocular video sequences requires maintaining a key-value (KV) cache that grows linearly with sequence length, creating a severe memory bottleneck. Existing approaches either truncate the cache to a fixed set of anchor frames, leading to reconstruction quality degradation, or rely on attention-score heuristics that are agnostic to 3D scene structure, failing to preserve geometrically valuable tokens. To address these problems, we present GHOST (Geometry-Hierarchical Online Streaming Token Eviction), a training-free KV cache management framework that exploits the model's own 3D geometry outputs to evict redundant tokens online. GHOST introduces three mutually reinforcing innovations: a hierarchical dual-level importance scoring scheme, a privilege mechanism that protects special tokens from eviction, and a cosine-similarity-guided layer-wise budget allocation.

相关公司

暂无数据

相关人物

暂无数据