An analysis of model-based Interval Estimation for Markov Decision Processes 论文

2008Journal of Computer and System Sciences引用 447
Machine Learning and AlgorithmsReinforcement Learning in RoboticsAdvanced Bandit Algorithms Research

An analysis of model-based Interval Estimation for Markov Decision Processes · 作者