Adaptive linear quadratic control using policy iteration 论文

2005引用 416

Adaptive Dynamic Programming ControlReinforcement Learning in RoboticsIterative Learning Control Systems

机器人 Reinforcement Learning in Robotics Iterative Learning Control Systems Adaptive Dynamic Programming Control

作者

摘要

In this paper we present the stability and convergence results for dynamic programming-based reinforcement learning applied to linear quadratic regulation (LQR). The specific algorithm we analyze is based on Q-learning and it is proven to converge to an optimal controller provided that the underlying system is controllable and a particular signal vector is persistently excited. This is the first convergence result for DP-based reinforcement learning algorithms for a continuous problem.

作者查看全部 (3)

Andrew G. Barto

B. Erik Ydstie

Steven J. Bradtke

Adaptive linear quadratic control using policy iteration 论文

摘要

作者查看全部 (3)

相关技术查看全部 (2)

相关事件

相关文章