Reinforcement Learning Control of a Flexible Two-Link Manipulator: An Experimental Investigation 论文

2020IEEE Transactions on Systems Man and Cybernetics Systems引用 273
Adaptive Dynamic Programming ControlAdaptive Control of Nonlinear SystemsReinforcement Learning in Robotics

摘要

This article discusses the control design and experiment validation of a flexible two-link manipulator (FTLM) system represented by ordinary differential equations (ODEs). A reinforcement learning (RL) control strategy is developed that is based on actor–critic structure to enable vibration suppression while retaining trajectory tracking. Subsequently, the closed-loop system with the proposed RL control algorithm is proved to be semi-global uniform ultimate bounded (SGUUB) by Lyapunov’s direct method. In the simulations, the control approach presented has been tested on the discretized ODE dynamic model and the analytical claims have been justified under the existence of uncertainty. Eventually, a series of experiments in a Quanser laboratory platform are investigated to demonstrate the effectiveness of the presented control and its application effect is compared with PD control.