Robot learning from demonstration by constructing skill trees 论文

2011The International Journal of Robotics Research引用 300
Robot Manipulation and LearningReinforcement Learning in RoboticsAI-based Problem Solving and Planning

摘要

We describe CST, an online algorithm for constructing skill trees from demonstration trajectories. CST segments a demonstration trajectory into a chain of component skills, where each skill has a goal and is assigned a suitable abstraction from an abstraction library. These properties permit skills to be improved efficiently using a policy learning algorithm. Chains from multiple demonstration trajectories are merged into a skill tree. We show that CST can be used to acquire skills from human demonstration in a dynamic continuous domain, and from both expert demonstration and learned control sequences on the uBot-5 mobile manipulator.