Representation Learning Enables Scalable Multitask Deep Reinforcement Learning 文章

ArXiv CS.AI2026-06-06NEWSen作者: Johan Obando-Ceron, Lu Li, Scott Fujimoto, Pierre-Luc Bacon, Aaron Courville, Pablo Samuel Castro

查看原文 →

关系图谱

详细信息

来源站点: ArXiv CS.AI
作者: Johan Obando-Ceron, Lu Li, Scott Fujimoto, Pierre-Luc Bacon, Aaron Courville, Pablo Samuel Castro
文章类型: NEWS
语言: en
发布日期: 2026-06-06

原文

摘要

arXiv:2606.05555v1 Announce Type: cross Abstract: Scaling reinforcement learning (RL) to diverse multitask settings remains a central challenge. While recent advances in model-based RL achieve strong performance, they rely on planning and complex training pipelines, making it unclear which components are essential for scalability. We revisit this question and argue that the primary driver of scalable multitask RL is not model-based control, but \emph{representation learning}. In particular, we show that combining predictive, model-based representations with high-capacity value function approximation is sufficient to achieve strong performance, even without planning. We evaluate a simple model-free algorithm, MR.Q, coupled with auxiliary predictive objectives into a scalable actor-critic architecture.

Representation Learning Enables Scalable Multitask Deep Reinforcement Learning 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (6)