Markovian Decision Processes with Uncertain Transition Probabilities 论文

1973Operations Research引用 227
Reinforcement Learning in RoboticsAdvanced Bandit Algorithms ResearchOptimization and Search Problems

Markovian Decision Processes with Uncertain Transition Probabilities · 作者