Efficient selectivity and backup operators in Monte-Carlo tree search

Authors:
Rémi Coulom
Affiliations:
CNRS-LIFL, INRIA-SequeL, Université Charles de Gaulle, Lille, France
Venue:
CG'06 Proceedings of the 5th international conference on Computers and games
Year:
2006

Citing 16
Cited 59

Searching with probabilities

Searching with probabilities
Expected-Outcome: A General Model of Static Evaluation

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Bayesian approach to relevance in game playing

Artificial Intelligence - Special issue on relevance
A Simulated Annealing Algorithm with Constant Temperature for Discrete Stochastic Optimization

Management Science
Computer Go: an AI oriented survey

Artificial Intelligence
Programming backgammon using self-teaching neural nets

Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Simulation Budget Allocation for Further Enhancing theEfficiency of Ordinal Optimization

Discrete Event Dynamic Systems
Learning to Predict by the Methods of Temporal Differences

Machine Learning
GIB: Steps Toward an Expert-Level Bridge-Playing Program

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Methods for statistical inference: extending the evolutionary computation paradigm

Methods for statistical inference: extending the evolutionary computation paradigm
An Adaptive Sampling Algorithm for Solving Markov Decision Processes

Operations Research
A sparse sampling algorithm for near-optimal planning in large Markov decision processes

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Combinatorics of Go

CG'06 Proceedings of the 5th international conference on Computers and games
Move-Pruning techniques for monte-carlo go

ACG'05 Proceedings of the 11th international conference on Advances in Computer Games
Associating shallow and selective global tree search with monte carlo for 9 × 9 go

CG'04 Proceedings of the 4th international conference on Computers and Games

Combining online and offline knowledge in UCT

Proceedings of the 24th international conference on Machine learning
Single-Player Monte-Carlo Tree Search

CG '08 Proceedings of the 6th international conference on Computers and Games
Amazons Discover Monte-Carlo

CG '08 Proceedings of the 6th international conference on Computers and Games
Monte-Carlo Tree Search Solver

CG '08 Proceedings of the 6th international conference on Computers and Games
Multi-player Go

CG '08 Proceedings of the 6th international conference on Computers and Games
Parallel Monte-Carlo Tree Search

CG '08 Proceedings of the 6th international conference on Computers and Games
A Parallel Monte-Carlo Tree Search Algorithm

CG '08 Proceedings of the 6th international conference on Computers and Games
Grid Coevolution for Adaptive Simulations: Application to the Building of Opening Books in the Game of Go

EvoWorkshops '09 Proceedings of the EvoWorkshops 2009 on Applications of Evolutionary Computing: EvoCOMNET, EvoENVIRONMENT, EvoFIN, EvoGAMES, EvoHOT, EvoIASP, EvoINTERACTION, EvoMUSART, EvoNUM, EvoSTOC, EvoTRANSLOG
Bandit-based optimization on graphs with application to library performance tuning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Simulation-based approach to general game playing

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Achieving master level play in 9×9 computer go

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Monte-Carlo Tree Search in Poker Using Expected Reward Distributions

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
A novel ontology for computer go knowledge management

FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Virtual global search: application to 9×9 Go

CG'06 Proceedings of the 5th international conference on Computers and games
Monte Carlo tree search in Kriegspiel

Artificial Intelligence
Nested Monte-Carlo Expression Discovery

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Consistency modifications for automatically tuned Monte-Carlo tree search

LION'10 Proceedings of the 4th international conference on Learning and intelligent optimization
Intelligent agents for the game of go

IEEE Computational Intelligence Magazine
Scalability and parallelization of Monte-Carlo tree search

CG'10 Proceedings of the 7th international conference on Computers and games
Biasing Monte-Carlo simulations through RAVE values

CG'10 Proceedings of the 7th international conference on Computers and games
Monte-Carlo simulation balancing in practice

CG'10 Proceedings of the 7th international conference on Computers and games
Score bounded Monte-Carlo tree search

CG'10 Proceedings of the 7th international conference on Computers and games
Improving Monte-Carlo tree search in Havannah

CG'10 Proceedings of the 7th international conference on Computers and games
Node-expansion operators for the UCT algorithm

CG'10 Proceedings of the 7th international conference on Computers and games
Monte-Carlo opening books for amazons

CG'10 Proceedings of the 7th international conference on Computers and games
Enhancements for multi-player Monte-Carlo tree search

CG'10 Proceedings of the 7th international conference on Computers and games
Computer poker: A review

Artificial Intelligence
Monte-Carlo tree search and rapid action value estimation in computer Go

Artificial Intelligence
Multiple tree for partially observable Monte-Carlo tree search

EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Revisiting Monte-Carlo tree search on a normal form game: NoGo

EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Upper confidence trees with short term partial information

EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
Multi-agent Monte Carlo Go

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Parallel Monte-Carlo tree search for HPC systems

Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part II
Multi-armed bandits with episode context

Annals of Mathematics and Artificial Intelligence
The grand challenge of computer Go: Monte Carlo tree search and extensions

Communications of the ACM
Adding expert knowledge and exploration in monte-carlo tree search

ACG'09 Proceedings of the 12th international conference on Advances in Computer Games
A lock-free multithreaded monte-carlo tree search algorithm

ACG'09 Proceedings of the 12th international conference on Advances in Computer Games
Evaluation function based monte-carlo LOA

ACG'09 Proceedings of the 12th international conference on Advances in Computer Games
Creating an upper-confidence-tree program for havannah

ACG'09 Proceedings of the 12th international conference on Advances in Computer Games
Bandit-Based genetic programming

EuroGP'10 Proceedings of the 13th European conference on Genetic Programming
Continuous upper confidence trees

LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
Parallel monte carlo tree search scalability discussion

AI'11 Proceedings of the 24th international conference on Advances in Artificial Intelligence
Computing approximate Nash Equilibria and robust best-responses using sampling

Journal of Artificial Intelligence Research
Probabilistic argumentation frameworks

TAFA'11 Proceedings of the First international conference on Theory and Applications of Formal Argumentation
Monte-Carlo tree search for the physical travelling salesman problem

EvoApplications'12 Proceedings of the 2012t European conference on Applications of Evolutionary Computation
Real-time solving of quantified CSPs based on Monte-Carlo game tree search

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Non-linear Monte-Carlo search in civilization II

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Single-player Monte-Carlo tree search for SameGame

Knowledge-Based Systems
UCD: Upper confidence bound for rooted directed acyclic graphs

Knowledge-Based Systems
Bitboard knowledge base system and elegant search architectures for Connect6

Knowledge-Based Systems
Genetic fuzzy markup language for game of NoGo

Knowledge-Based Systems
Bootstrapping monte carlo tree search with an imperfect heuristic

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Upper confidence tree-based consistent reactive planning application to minesweeper

LION'12 Proceedings of the 6th international conference on Learning and Intelligent Optimization
Improving the exploration in upper confidence trees

LION'12 Proceedings of the 6th international conference on Learning and Intelligent Optimization
Sufficiency-based selection strategy for MCTS

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Monte Carlo *-minimax search

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Using reinforcement learning to find an optimal set of features

Computers & Mathematics with Applications
Remarks on history and presence of game tree search and research

Information Theory, Combinatorics, and Search Theory

Quantified Score

Hi-index	0.02

Visualization

Abstract

A Monte-Carlo evaluation consists in estimating a position by averaging the outcome of several random continuations. The method can serve as an evaluation function at the leaves of a min-max tree. This paper presents a new framework to combine tree search with Monte-Carlo evaluation, that does not separate between a min-max phase and a Monte-Carlo phase. Instead of backing-up the min-max value close to the root, and the average value at some depth, a more general backup operator is defined that progressively changes from averaging to minmax as the number of simulations grows. This approach provides a finegrained control of the tree growth, at the level of individual simulations, and allows efficient selectivity. The resulting algorithm was implemented in a 9 × 9 Go-playing program, Crazy Stone, that won the 10th KGS computer-Go tournament.