Video Reasoning without Training 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Video Reasoning without Training arXiv:2510.17045v2 Announce Type: replace Abstract: Video reasoning using Large Multimodal Models (LMMs) relies on costly reinforcement learning (RL) and verbose chain-of-thought, resulting in substantial computational overhead during both training and inference. Moreover, the mechanisms that control the thinking process in these reasoning models are very limited. In this paper, we use the entropy of the model's output distribution as a signal to study and guide
相关产品查看全部 (10)
相关报道查看全部 (1)
Video Reasoning without Training
ArXiv CS.CV2026-06-02