Honte, a go-playing program using neural nets

  • Authors:
  • Fredrik A. Dahl

  • Affiliations:
  • Norwegian Defence Research Establishment Division for Systems Analysis, P.O. Box 25, NO-2027 Kjeller, Norway

  • Venue:
  • Machines that learn to play games
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Go-playing program HONTE is described. It uses neural nets together with more conventional AI-methods like alpha-beta search. A neural net is trained by supervised learning to imitate local shapes seen in a database of expert games. A second net is trained to estimate the safety of groups by self play using a modified version of TD(λ)-learning. A third net is trained to estimate territorial potential of unoccupied points, also based on self play and TD(λ)- learning. Although the program has not yet reached the level of the most popular commercial Go-programs, results are encouraging.