Node-expansion operators for the UCT algorithm

Authors:
Takayuki Yajima;Tsuyoshi Hashimoto;Toshiki Matsui;Junichi Hashimoto;Kristian Spoerer
Affiliations:
Japan Advanced Institute of Science and Technology;Matsue College of Technology;KDDI Research Laboratory;Japan Advanced Institute of Science and Technology;Japan Advanced Institute of Science and Technology
Venue:
CG'10 Proceedings of the 7th international conference on Computers and games
Year:
2010

Citing 9
Cited 0

Finite-time Analysis of the Multiarmed Bandit Problem

Machine Learning
Amazons Discover Monte-Carlo

CG '08 Proceedings of the 6th international conference on Computers and Games
Monte-Carlo Tree Search Solver

CG '08 Proceedings of the 6th international conference on Computers and Games
Achieving master level play in 9×9 computer go

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Monte Carlo tree search techniques in the game of Kriegspiel

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Efficient selectivity and backup operators in Monte-Carlo tree search

CG'06 Proceedings of the 5th international conference on Computers and games
Bandit based monte-carlo planning

ECML'06 Proceedings of the 17th European conference on Machine Learning
Monte-Carlo tree search in settlers of catan

ACG'09 Proceedings of the 12th international conference on Advances in Computer Games
Move-Pruning techniques for monte-carlo go

ACG'05 Proceedings of the 11th international conference on Advances in Computer Games

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent works on the MCTS and UCT framework in the domain of Go focused on introducing knowledge to the playout and on pruning variations from the tree, but so far node expansion has not been investigated. In this paper we show that delaying expansion according to the number of the siblings delivers a gain of more than 92% when compared to normal expansion. We propose three improvements; one that uses domain knowledge and two that are domain-independent methods. Experimental results show that all advanced operators significantly improve the UCT performance when compared to the basic delaying expansion. From the results we may conclude that the new expansion operators are an appropriate means to improve the UCT algorithm.