WorldCraft: From Camera Navigation to Object Manipulation in Interactive Video World Models 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

WorldCraft: From Camera Navigation to Object Manipulation in Interactive Video World Models arXiv:2605.25077v1 Announce Type: new Abstract: Recent video-based world models have made pixel-space environments interactive at the camera level: users can navigate viewpoints while the model generates coherent visual continuations. Yet their action spaces remain incomplete: users can move the camera, but cannot act on individual objects. Since real-world interaction is inherently object-centric, such