IVR-R1: Refining Trajectories through Iterative Visual-Grounded Reasoning in Reinforcement Learning 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
IVR-R1: Refining Trajectories through Iterative Visual-Grounded Reasoning in Reinforcement Learning arXiv:2605.23997v1 Announce Type: new Abstract: Multimodal large language models via reinforcement learning (RL) have demonstrated remarkable capabilities in complex visual reasoning tasks, yet they remain limited in long-horizon multimodal scenarios, often suffering from visual hallucination and logical error. Current methods typically pre-encode high-dimensional visual scenes into discrete text