Dynamic randomization and domain knowledge in Monte-Carlo Tree Search for Go knowledge-based systems

  • Authors:
  • Keh-Hsun Chen

  • Affiliations:
  • Department of Computer Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA

  • Venue:
  • Knowledge-Based Systems
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper is an extension of the article [13] presented at IWCG of TAAI 2010. It proposes two dynamic randomization techniques for Monte-Carlo Tree Search (MCTS) in Go. First, during the in-tree phase of a simulation game, the parameters are randomized in selected ranges before each simulation move. Second, during the play-out phase, the priority orders of the simulation move-generators are hierarchically randomized before each play-out move. Essential domain knowledge used in MCTS for Go is discussed. Both dynamic randomization techniques increase diversity while keeping the sanity of the simulation games. Experimental testing has been completely re-conducted more extensively with the latest version of GoIntellect (GI) on all three Go categories of 19x19, 13x13, and 9x9 boards. The results show that dynamic randomization increases the playing strength of GI significantly with 128K simulations per move, the improvement is about seven percentage points in the winning rate against GnuGo on 19x19 Go over the version of GI without dynamic randomization, about three percentage points on 13x13 Go, and four percentage points on 9x9 Go.