From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning 论文
2014Foundations and Trends® in Machine Learning引用 247
Advanced Bandit Algorithms ResearchReinforcement Learning in RoboticsArtificial Intelligence in Games