A Reinforcement Learning Method for Maximizing Undiscounted Rewards 论文

1993Elsevier eBooks引用 318
Reinforcement Learning in RoboticsSupply Chain and Inventory ManagementAuction Theory and Applications