Observing the evolution of neural networks learning to play the game of Othello

Authors:
S. Y. Chong;M. K. Tan;J. D. White
Affiliations:
Sch. of Comput. Sci., Univ. of Birmingham, UK;-;-
Venue:
IEEE Transactions on Evolutionary Computation
Year:
2005

Citing 0
Cited 7

Application of reinforcement learning to the game of Othello

Computers and Operations Research
Neural network Othello competition

ACM SIGEVOlution
Spatial processing layer effects on the evolution of neural networks to play the game of Othello

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Coevolution in a large search space using resource-limited nash memory

Proceedings of the 12th annual conference on Genetic and evolutionary computation
Learning n-tuple networks for othello by coevolutionary gradient search

Proceedings of the 13th annual conference on Genetic and evolutionary computation
Improving coevolution by random sampling

Proceedings of the 15th annual conference on Genetic and evolutionary computation
Shaping fitness function for evolutionary learning of game strategies

Proceedings of the 15th annual conference on Genetic and evolutionary computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

A study was conducted to find out how game-playing strategies for Othello (also known as reversi) can be learned without expert knowledge. The approach used the coevolution of a fixed-architecture neural-network-based evaluation function combined with a standard minimax search algorithm. Comparisons between evolving neural networks and computer players that used deterministic strategies allowed evolution to be observed in real-time. Neural networks evolved to outperform the computer players playing at higher ply-depths, despite being handicapped by playing black and using minimax at ply-depth of two. In addition, the playing ability of the population progressed from novice, to intermediate, and then to master's level. Individual neural networks discovered various game-playing strategies, starting with positional and later mobility. These results show that neural networks can be evolved as evaluation functions, despite the general difficulties associated with this approach. Success in this case was due to a simple spatial preprocessing layer in the neural network that captured spatial information, self-adaptation of every weight and bias of the neural network, and a selection method that allowed a diverse population of neural networks to be carried forward from one generation to the next.