From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning 论文

2014Foundations and Trends® in Machine Learning引用 247

Advanced Bandit Algorithms ResearchReinforcement Learning in RoboticsArtificial Intelligence in Games

From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning · 作者

Rémi Munos