COMAP: Co-Evolving World Models and Agent Policies for LLM Agents 文章

ArXiv CS.CL2026-06-02NEWSen作者: Youwei Liu, Jian Wang, Hanlin Wang, Wenjie Li

摘要

arXiv:2606.02372v1 Announce Type: cross Abstract: Equipping language agents with world models enables them to anticipate environment dynamics and evaluate candidate actions before execution. However, existing textual world models are typically fixed after training, preventing them from adapting to the on-policy state-action distributions induced by an evolving agent. Meanwhile, agent-improvement methods often rely on external rewards or verifiers, limiting their applicability in realistic interactive environments. In this paper, we propose COMAP, a novel framework that co-evolves textual world models and agent policies through closed-loop interaction. At each decision step, the world model predicts future state feedback for candidate actions, and the agent performs future-aware reflection by estimating the reliability of this feedback and refining its action accordingly.

相关公司

暂无数据

相关人物

暂无数据