IPR-1: Interactive Physical Reasoner 文章

ArXiv CS.CV2026-05-26NEWSen作者: Mingyu Zhang, Lifeng Zhuo, Tianxi Tan, Guocan Xie, Xian Nie, Yan Li, Renjie Zhao, Zizhu He, Ziyu Wang, Jiting Cai, Yong-Lu Li

查看原文 →

关系图谱

摘要

arXiv:2511.15407v4 Announce Type: replace-cross Abstract: Humans learn by observing, interacting with environments, and internalizing physics and causality. Here, we aim to ask whether an agent can similarly acquire human-like reasoning from interaction and keep improving with more experience. To study this, we introduce a Game-to-Unseen (G2U) benchmark of 1,000+ heterogeneous games that exhibit significant visual domain gaps. Existing approaches, including VLMs and world models, struggle to capture underlying physics and causality since they are not focused on core mechanisms and overfit to visual details. VLM/VLA agents reason but lack look-ahead in interactive settings, while world models imagine but imitate visual patterns rather than analyze physics and causality.

IPR-1: Interactive Physical Reasoner 文章

摘要

相关事件查看全部 (2)

相关公司查看全部 (4)

相关人物

相关产品查看全部 (10)

相关技术查看全部 (18)