Approximate policy iteration: a survey and some new methods 论文

2011Journal of Control Theory and Applications引用 264
Reinforcement Learning in RoboticsAdaptive Dynamic Programming ControlModel Reduction and Neural Networks