Beyond Rigid: Benchmarking Non-Rigid Video Editing 文章

ArXiv CS.CV2026-06-02NEWSen作者: Bingzheng Qu, Xuefeng Bai, Kehai Chen, Min Zhang

摘要

arXiv:2601.18340v2 Announce Type: replace Abstract: As video generation models are increasingly expected to manipulate physical dynamics, there is a growing need to move evaluation beyond appearance fidelity and semantic alignment. Non-rigid video editing offers a uniquely revealing testbed, where distinct materials impose distinct physical constraints. In this paper, we introduce NRVBench, a diagnostic benchmark for non-rigid video editing, where the task is to modify deformable motion while preserving irrelevant regions and maintaining material-specific plausibility. NRVBench contains 180 curated videos across six physics-grounded categories, 2,340 fine-grained editing instructions, 360 multiple-choice questions, and pixel-accurate masks. We further propose NRVE-Acc, a structured VLM-based protocol that decomposes editing success into instruction following, material-aware deformation plausibility, and temporal coherence with motion cues.

Beyond Rigid: Benchmarking Non-Rigid Video Editing 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (7)

相关技术查看全部 (2)