Finite-Time Bounds for Fitted Value Iteration 论文

2008引用 263
Markov Chains and Monte Carlo MethodsMachine Learning and AlgorithmsReinforcement Learning in Robotics

Finite-Time Bounds for Fitted Value Iteration · 相关技术