Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics 文章

ArXiv CS.AI2026-06-04NEWSen作者: Jashaswimalya Acharjee, Balaraman Ravindran

详细信息

来源站点: ArXiv CS.AI
作者: Jashaswimalya Acharjee, Balaraman Ravindran
文章类型: NEWS
语言: en
发布日期: 2026-06-04

摘要

arXiv:2602.12643v2 Announce Type: replace-cross Abstract: We present Unified Latent Dynamics (ULD), a novel reinforcement learning algorithm that unifies the efficiency of model-free methods with the representational strengths of model-based approaches, without incurring planning overhead. By embedding state-action pairs into a latent space in which the true value function is approximately linear, our method supports a single set of hyperparameters across diverse domains -- from continuous control with low-dimensional and pixel inputs to high-dimensional Atari games. We prove that, under mild conditions, the fixed point of our embedding-based temporal-difference updates coincides with that of a corresponding linear model-based value expansion, and we derive explicit error bounds relating embedding fidelity to value approximation quality.

Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (1)