Learning continuous control policies by stochastic value gradients 论文

2015arXiv (Cornell University)引用 286
Reinforcement Learning in RoboticsGaussian Processes and Bayesian InferenceModel Reduction and Neural Networks

Learning continuous control policies by stochastic value gradients · 相关技术