Monte-Carlo tree search and rapid action value estimation in computer Go

Authors:
Sylvain Gelly;David Silver
Affiliations:
Université Paris Sud, LRI, CNRS, INRIA, France;University College London, UK
Venue:
Artificial Intelligence
Year:
2011

Citing 26
Cited 5

The History Heuristic and Alpha-Beta Search Enhancements in Practice

IEEE Transactions on Pattern Analysis and Machine Intelligence
Expected-Outcome: A General Model of Static Evaluation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Using probabilistic knowledge and simulation to play poker

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Computer Go: an AI oriented survey

Artificial Intelligence
Computer Go

Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
World-championship-caliber Scrabble

Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
Finite-time Analysis of the Multiarmed Bandit Problem

Machine Learning
Honte, a go-playing program using neural nets

Machines that learn to play games
Combining online and offline knowledge in UCT

Proceedings of the 24th international conference on Machine learning
Sample-based learning and search with permanent and transient memories

Proceedings of the 25th international conference on Machine learning
Amazons Discover Monte-Carlo

CG '08 Proceedings of the 6th international conference on Computers and Games
An Analysis of UCT in Multi-player Games

CG '08 Proceedings of the 6th international conference on Computers and Games
Monte-Carlo simulation balancing

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Simulation-based approach to general game playing

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Achieving master level play in 9×9 computer go

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Reinforcement learning of local shape in the game of go

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
UCT for tactical assault planning in real-time strategy games

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Introduction of a new paraphrase generation tool based on Monte-Carlo sampling

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Associating domain-dependent knowledge and Monte Carlo approaches within a Go program

Information Sciences: an International Journal
Efficient selectivity and backup operators in Monte-Carlo tree search

CG'06 Proceedings of the 5th international conference on Computers and games
Reinforcement learning and simulation-based search in computer go

Reinforcement learning and simulation-based search in computer go
Monte-Carlo simulation balancing in practice

CG'10 Proceedings of the 7th international conference on Computers and games
Bandit based monte-carlo planning

ECML'06 Proceedings of the 17th European conference on Machine Learning
Evaluation function based monte-carlo LOA

ACG'09 Proceedings of the 12th international conference on Advances in Computer Games
Creating an upper-confidence-tree program for havannah

ACG'09 Proceedings of the 12th international conference on Advances in Computer Games
Move-Pruning techniques for monte-carlo go

ACG'05 Proceedings of the 11th international conference on Advances in Computer Games

The grand challenge of computer Go: Monte Carlo tree search and extensions

Communications of the ACM
Monte-Carlo tree search for the physical travelling salesman problem

EvoApplications'12 Proceedings of the 2012t European conference on Applications of Evolutionary Computation
Bitboard knowledge base system and elegant search architectures for Connect6

Knowledge-Based Systems
Procedural knowledge for integrated modelling: Towards the Modelling Playground

Environmental Modelling & Software
A survey of multi-objective sequential decision-making

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.02

Visualization

Abstract

A new paradigm for search, based on Monte-Carlo simulation, has revolutionised the performance of computer Go programs. In this article we describe two extensions to the Monte-Carlo tree search algorithm, which significantly improve the effectiveness of the basic algorithm. When we applied these two extensions to the Go program MoGo, it became the first program to achieve dan (master) level in 9x9 Go. In this article we survey the Monte-Carlo revolution in computer Go, outline the key ideas that led to the success of MoGo and subsequent Go programs, and provide for the first time a comprehensive description, in theory and in practice, of this extended framework for Monte-Carlo tree search.