Information theoretic MPC for model-based reinforcement learning 论文

2017引用 440

Advanced Control Systems OptimizationReinforcement Learning in RoboticsControl Systems and Identification

机器人 Control Systems and Identification Reinforcement Learning in Robotics Advanced Control Systems Optimization

作者

摘要

We introduce an information theoretic model predictive control (MPC) algorithm capable of handling complex cost criteria and general nonlinear dynamics. The generality of the approach makes it possible to use multi-layer neural networks as dynamics models, which we incorporate into our MPC algorithm in order to solve model-based reinforcement learning tasks. We test the algorithm in simulation on a cart-pole swing up and quadrotor navigation task, as well as on actual hardware in an aggressive driving task. Empirical results demonstrate that the algorithm is capable of achieving a high level of performance and does so only utilizing data collected from the system.

作者查看全部 (7)

Evangelos A. Theodorou

Byron Boots

James M. Rehg

Paul Drews

Information theoretic MPC for model-based reinforcement learning 论文

摘要

作者查看全部 (7)

相关技术查看全部 (2)

相关事件

相关文章