Model-Free reinforcement learning with continuous action in practice 论文

2012引用 231

Reinforcement Learning in RoboticsViral Infectious Diseases and Gene Expression in InsectsRobot Manipulation and Learning

机器人 Reinforcement Learning in Robotics Robot Manipulation and Learning Viral Infectious Diseases and Gene Expression in Insects

关系图谱

作者

摘要

Reinforcement learning methods are often considered as a potential solution to enable a robot to adapt to changes in real time to an unpredictable environment. However, with continuous action, only a few existing algorithms are practical for real-time learning. In such a setting, most effective methods have used a parameterized policy structure, often with a separate parameterized value function. The goal of this paper is to assess such actor-critic methods to form a fully specified practical algorithm. Our specific contributions include 1) developing the extension of existing incremental policy-gradient algorithms to use eligibility traces, 2) an empirical comparison of the resulting algorithms using continuous actions, 3) the evaluation of a gradient-scaling technique that can significantly improve performance. Finally, we apply our actor-critic algorithm to learn on a robotic platform with a fast sensorimotor cycle (10ms). Overall, these results constitute an important step towards practical real-time learning control with continuous action.

作者查看全部 (3)

理

理查德·S·薩頓

Patrick M. Pilarski

Thomas Degris

Model-Free reinforcement learning with continuous action in practice 论文

摘要

作者查看全部 (3)

相关技术查看全部 (2)

相关事件

相关文章