Occlusion-Aware Physics-Semantic Keyframe Selection for Robust Video Editing 文章

ArXiv CS.CV2026-05-28NEWSen作者: Lin Liu, Zhihan Xiao, Haohang Xu, Rong Cong, Zhibo Zhang, Xiaopeng Zhang, Qi Tian

摘要

arXiv:2605.23192v2 Announce Type: replace Abstract: Video editing has recently achieved remarkable progress with diffusion-based generative models, enabling diverse object-level manipulations from natural language instructions. However, existing methods often struggle under occlusion, viewpoint changes, and fast object motion, where unreliable visual observations lead to inaccurate localization, temporal flickering, and inconsistent edits. In this work, we identify the absence of reliable visual anchors as a fundamental bottleneck in occlusion-robust video editing. To address this issue, we propose an occlusion-aware physics-semantic keyframe selection framework that automatically identifies an optimal anchor frame for downstream editing.

Occlusion-Aware Physics-Semantic Keyframe Selection for Robust Video Editing 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术