Dynamic Programming and Optimal Control 3rd Edition, Volume II 论文

2010引用 220

Adaptive Dynamic Programming ControlReinforcement Learning in RoboticsSmart Grid Energy Management

机器人 Reinforcement Learning in Robotics Smart Grid Energy Management Adaptive Dynamic Programming Control

作者

摘要

This is an updated version of the research-oriented Chapter 6 on Approximate Dynamic Programming. It will be periodically updated as new research becomes available, and will replace the current Chapter 6 in the book’s next printing. In addition to editorial revisions, rearrangements, and new exercises, the chapter includes an account of new research, which is collected mostly in Sections 6.3 and 6.8. Furthermore, a lot of new material has been added, such as an account of post-decision state simplifications (Section 6.1), regression-based TD methods (Section 6.3), feature scaling (Section 6.3), policy oscillations (Section 6.3), λ-policy iteration and exploration enhanced TD methods, aggregation methods (Section 6.4), new Q-learning algorithms (Section 6.5), and Monte Carlo linear algebra (Section 6.8). This chapter represents “work in progress.” It more than likely contains errors (hopefully not serious ones). Furthermore, its references to the literature are incomplete. Your comments and suggestions to the author at dimitrib@mit.edu are welcome. The date of last revision is given below.

作者查看全部 (1)

德

德梅萃·P. 博赛卡斯

Dynamic Programming and Optimal Control 3rd Edition, Volume II 论文

摘要

作者查看全部 (1)

相关技术查看全部 (1)

相关事件

相关文章