Reinforcement learning improves behaviour from evaluative feedback 论文

2015Nature引用 367
Reinforcement Learning in RoboticsData Stream Mining TechniquesAdvanced Bandit Algorithms Research

Reinforcement learning improves behaviour from evaluative feedback · 作者