Reinforcement learning improves behaviour from evaluative feedback 论文

2015Nature引用 367
Reinforcement Learning in RoboticsData Stream Mining TechniquesAdvanced Bandit Algorithms Research