Modiﬁcation of UCT with Patterns in Monte-Carlo Go 论文

2006HAL (Le Centre pour la Communication Scientifique Directe)引用 294

Artificial Intelligence in GamesReinforcement Learning in RoboticsMetaheuristic Optimization Algorithms Research

机器人 Metaheuristic Optimization Algorithms Research Reinforcement Learning in Robotics Artificial Intelligence in Games

作者

摘要

Algorithm UCB1 for multi-armed bandit problem has already been extended to Algorithm UCT (Upper bound Confidence for Tree) which works for minimax tree search. We have developed a Monte-Carlo Go program, MoGo, which is the first computer Go program using UCT. We explain our modification of UCT for Go application and also the intelligent random simulation with patterns which has improved significantly the performance of MoGo. UCT combined with pruning techniques for large Go board is discussed, as well as parallelization of UCT. MoGo is now a top level Go program on $9\times9$ and $13\times13$ Go boards.

作者查看全部 (4)

Olivier Teytaud

Rémi Munos

Yizao Wang

Sylvain Gelly

Modiﬁcation of UCT with Patterns in Monte-Carlo Go 论文

摘要

作者查看全部 (4)

相关技术查看全部 (2)

相关事件

相关文章