Maximum Entropy Inverse Reinforcement Learning 论文

2018Research Showcase @ Carnegie Mellon University (Carnegie Mellon University)引用 2055

Reinforcement Learning in RoboticsDiffusion and Search DynamicsAutonomous Vehicle Technology and Safety

机器人 Reinforcement Learning in Robotics Diffusion and Search Dynamics Autonomous Vehicle Technology and Safety

作者

摘要

Recent research has shown the beneﬁt of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of re- covering a utility function that makes the behavior induced by a near-optimal policy closely mimic demonstrated behavior. In this work, we develop a probabilistic approach based on the principle of maximum entropy. Our approach provides a well-deﬁned, globally normalized distribution over decision sequences, while providing the same performance guarantees as existing methods. We develop our technique in the context of modeling real world navigation and driving behaviors where collected data is inherently noisy and imperfect. Our probabilistic approach enables modeling of route preferences as well as a powerful new approach to inferring destinations and routes based on partial trajectories.

作者查看全部 (4)

Anind K. Dey

J. Andrew Bagnell

Andrew L. Maas

Brian D. Ziebart

Maximum Entropy Inverse Reinforcement Learning 论文

摘要

作者查看全部 (4)

相关技术查看全部 (2)

相关事件

相关文章