Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path 论文

2007Machine Learning引用 249

Markov Chains and Monte Carlo MethodsMachine Learning and AlgorithmsReinforcement Learning in Robotics

机器人 Machine Learning and Algorithms Reinforcement Learning in Robotics Markov Chains and Monte Carlo Methods

相关技术:Markov Chains and Monte Carlo Methods Machine Learning and Algorithms Reinforcement Learning in Robotics

3

作者

3

相关技术

0

相关事件

0

相关文章

作者查看全部 (3)

Rémi Munos

Csaba Szepesvári

András Antos

相关技术查看全部 (3)

Markov Chains and Monte Carlo Methods Machine Learning and Algorithms Reinforcement Learning in Robotics

相关事件

暂无数据

相关文章

暂无数据