Handbook of Markov Decision Processes 论文
2002International series in management science/operations research/International series in operations research & management science引用 300
Reinforcement Learning in RoboticsAdvanced Queuing Theory AnalysisStability and Control of Uncertain Systems