A three-network architecture for on-line learning and optimization based on adaptive dynamic programming 论文

2011Neurocomputing引用 221
Adaptive Dynamic Programming ControlReinforcement Learning in RoboticsMechanical Circulatory Support Devices