Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics 文章

ArXiv CS.AI2026-06-04NEWSen作者: Jashaswimalya Acharjee, Balaraman Ravindran

详细信息

来源站点
ArXiv CS.AI
作者
Jashaswimalya Acharjee, Balaraman Ravindran
文章类型
NEWS
语言
en
发布日期
2026-06-04

摘要

arXiv:2602.12643v2 Announce Type: replace-cross Abstract: We present Unified Latent Dynamics (ULD), a novel reinforcement learning algorithm that unifies the efficiency of model-free methods with the representational strengths of model-based approaches, without incurring planning overhead. By embedding state-action pairs into a latent space in which the true value function is approximately linear, our method supports a single set of hyperparameters across diverse domains -- from continuous control with low-dimensional and pixel inputs to high-dimensional Atari games. We prove that, under mild conditions, the fixed point of our embedding-based temporal-difference updates coincides with that of a corresponding linear model-based value expansion, and we derive explicit error bounds relating embedding fidelity to value approximation quality.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据