Reinforcement learning with human teachers: evidence of feedback and guidance with implications for learning performance 论文

2006引用 250

Reinforcement Learning in RoboticsRobot Manipulation and LearningMachine Learning and Algorithms

机器人 Machine Learning and Algorithms Reinforcement Learning in Robotics Robot Manipulation and Learning

作者

摘要

As robots become a mass consumer product, they will need to learn new skills by interacting with typical hu-man users. Past approaches have adapted reinforcement learning (RL) to accept a human reward signal; how-ever, we question the implicit assumption that people shall only want to give the learner feedback on its past actions. We present findings from a human user study showing that people use the reward signal not only to provide feedback about past actions, but also to pro-vide future directed rewards to guide subsequent ac-tions. Given this, we made specific modifications to the simulated RL robot to incorporate guidance. We then analyze and evaluate its learning performance in a second user study, and we report significant improve-ments on several measures. This work demonstrates the importance of understanding the human-teacher/robot-learner system as a whole in order to design algorithms that support how people want to teach while simultane-ously improving the robot’s learning performance.

作者查看全部 (2)

Cynthia Breazeal

Andrea L. Thomaz

Reinforcement learning with human teachers: evidence of feedback and guidance with implications for learning performance 论文

摘要

作者查看全部 (2)

相关技术查看全部 (3)

相关事件

相关文章