Actor-Critic--Type Learning Algorithms for Markov Decision Processes 论文

1999SIAM Journal on Control and Optimization引用 238
Reinforcement Learning in RoboticsAdaptive Dynamic Programming ControlAdvanced Control Systems Optimization

Actor-Critic--Type Learning Algorithms for Markov Decision Processes · 相关技术