Knowledge-Preserved Model Tuning in Null-Space for Robust Spatio-Temporal Video Grounding 文章

ArXiv CS.CV2026-06-03NEWSen作者: Haoxuan Chen, Xianqin Liu, Jian-Fang Hu

摘要

arXiv:2606.03539v1 Announce Type: new Abstract: Spatio-Temporal Video Grounding aims to localize object tubes based on textual queries. While recent methods have achieved remarkable success, they mainly focus on high-quality(HQ) inputs, neglecting the widespread presence of low-quality(LQ) videos in real-world scenarios. Although tuning methods like LoRA can adapt to degraded inputs, they inevitably disrupt pre-trained knowledge. To address this, we propose Null-Space Tuning (NST). This framework exploits the geometric property that adding vectors within the null-space of frozen weights to the layer input does not affect the output. Leveraging this, NST injects learnable residuals into input features that can be selectively invisible to the pre-trained backbone.

Knowledge-Preserved Model Tuning in Null-Space for Robust Spatio-Temporal Video Grounding 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术查看全部 (2)