Video Reasoning without Training 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Video Reasoning without Training arXiv:2510.17045v2 Announce Type: replace Abstract: Video reasoning using Large Multimodal Models (LMMs) relies on costly reinforcement learning (RL) and verbose chain-of-thought, resulting in substantial computational overhead during both training and inference. Moreover, the mechanisms that control the thinking process in these reasoning models are very limited. In this paper, we use the entropy of the model's output distribution as a signal to study and guide