VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization 事件

BREAKTHROUGH2026-06-02影响: HIGH

VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization arXiv:2606.02564v1 Announce Type: new Abstract: The recent "Reasoning with Video" paradigm utilizes Video Generation Models (VGMs) to generate temporally coherent visual trajectories to complete reasoning tasks. Although state-of-the-art VGMs excel at visual quality, they often struggle to understand and follow task-specific rules, leading to logical failures across diverse reasoning scenarios. Existing efforts try t