Thinking with Imagination: Agentic Visual Spatial Reasoning with World Simulators 事件

PRODUCT_LAUNCH2026-06-05影响: MEDIUM

Thinking with Imagination: Agentic Visual Spatial Reasoning with World Simulators arXiv:2606.06476v1 Announce Type: new Abstract: While Vision-Language Models (VLMs) have shown strong visual reasoning capabilities, their spatial reasoning abilities remain largely constrained to the observed images and text-oriented chain-of-thought. They often struggle to infer unobserved layouts, maintain cross-view consistency, and reason from alternative viewpoints when only limited egocentric observations a