The grand challenge of computer Go: Monte Carlo tree search and extensions

Authors:
Sylvain Gelly;Levente Kocsis;Marc Schoenauer;Michèle Sebag;David Silver;Csaba Szepesvári;Olivier Teytaud
Affiliations:
EPC TAO, INRIA Saclay & LRI, Orsay, France;MTA SZTAKI, Budapest, Hungary;EPC TAO, INRIA Saclay & LRI, Orsay, France;EPC TAO, INRIA Saclay & LRI, Orsay, France;University College London, London, U.K.;University of Alberta, Edmonton, Canada;EPC TAO, INRIA Saclay & LRI, Orsay, France
Venue:
Communications of the ACM
Year:
2012

Citing 12
Cited 2

Expected-Outcome: A General Model of Static Evaluation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Finite-time Analysis of the Multiarmed Bandit Problem

Machine Learning
Optimal unbiased estimators for evaluating agent performance

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Simulation-based approach to general game playing

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Monte-Carlo exploration for deterministic planning

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
A study on security evaluation methodology for image-based biometrics authentication systems

BTAS'09 Proceedings of the 3rd IEEE international conference on Biometrics: Theory, applications and systems
Efficient selectivity and backup operators in Monte-Carlo tree search

CG'06 Proceedings of the 5th international conference on Computers and games
The true score of statistical paraphrase generation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Scalability and parallelization of Monte-Carlo tree search

CG'10 Proceedings of the 7th international conference on Computers and games
Monte-Carlo simulation balancing in practice

CG'10 Proceedings of the 7th international conference on Computers and games
Monte-Carlo tree search and rapid action value estimation in computer Go

Artificial Intelligence
Bandit based monte-carlo planning

ECML'06 Proceedings of the 17th European conference on Machine Learning

Design and parametric considerations for artificial neural network pruning in UCT game playing

Proceedings of the South African Institute for Computer Scientists and Information Technologists Conference
Scalable and efficient bayes-adaptive reinforcement learning based on monte-carlo tree search

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	48.22

Visualization

Abstract

The ancient oriental game of Go has long been considered a grand challenge for artificial intelligence. For decades, computer Go has defied the classical methods in game tree search that worked so successfully for chess and checkers. However, recent play in computer Go has been transformed by a new paradigm for tree search based on Monte-Carlo methods. Programs based on Monte-Carlo tree search now play at human-master levels and are beginning to challenge top professional players. In this paper, we describe the leading algorithms for Monte-Carlo tree search and explain how they have advanced the state of the art in computer Go.