PatchWorld: Gradient-Free Optimization of Executable World Models 事件
PRODUCT_LAUNCH2026-06-01影响: MEDIUM
PatchWorld: Gradient-Free Optimization of Executable World Models arXiv:2605.30880v1 Announce Type: new Abstract: Text-agent environments are typically modeled as partially observable Markov decision processes (POMDPs), assuming that the simulator's latent state and transition dynamics are hidden from the agent. Yet little work has examined whether executable code can be induced to serve as a world model for prediction and planning under partial observability. We introduce PatchWorld, a gradien