VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization 事件
BREAKTHROUGH2026-06-02影响: HIGH
VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization arXiv:2606.02564v1 Announce Type: new Abstract: The recent "Reasoning with Video" paradigm utilizes Video Generation Models (VGMs) to generate temporally coherent visual trajectories to complete reasoning tasks. Although state-of-the-art VGMs excel at visual quality, they often struggle to understand and follow task-specific rules, leading to logical failures across diverse reasoning scenarios. Existing efforts try t
相关产品查看全部 (10)
相关报道查看全部 (1)
VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization
ArXiv CS.CV2026-06-02