Bandit Based Monte-Carlo Planning 论文

2006Lecture notes in computer science引用 2856
Advanced Bandit Algorithms ResearchReinforcement Learning in RoboticsMachine Learning and Algorithms