Olaf-World: Orienting Latent Actions for Video World Modeling 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

Olaf-World: Orienting Latent Actions for Video World Modeling arXiv:2602.10104v2 Announce Type: replace Abstract: Scaling action-controllable world models is limited by the scarcity of action labels. While latent action learning promises to extract control interfaces from unlabeled video, learned latents often fail to transfer across contexts: they entangle scene-specific cues and lack a shared coordinate system. This occurs because standard objectives operate only within each clip, providing n