Coevolutionary temporal difference learning for Othello

Authors:
Marcin Szubert;Wojciech Jaskowski;Krzysztof Krawiec
Affiliations:
Institute of Computing Science, Poznan University of Technology, Poznan, Poland;Institute of Computing Science, Poznan University of Technology, Poznan, Poland;Institute of Computing Science, Poznan University of Technology, Poznan, Poland
Venue:
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Year:
2009

Citing 22
Cited 5

The development of a world class Othello program

Artificial Intelligence - Special issue on computer chess
Temporal difference learning and TD-Gammon

Communications of the ACM
Co-Evolution in the Successful Learning of Backgammon Strategy

Machine Learning
Blondie24: playing at the edge of AI

Blondie24: playing at the edge of AI
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Co-evolving a Neural-Net Evaluation Function for Othello by Combining Genetic Algorithms and Reinforcement Learning

ICCS '01 Proceedings of the International Conference on Computational Science-Part II
Competitive Environments Evolve Better Solutions for Complex Tasks

Proceedings of the 5th International Conference on Genetic Algorithms
Solution concepts in coevolutionary algorithms

Solution concepts in coevolutionary algorithms
The MaxSolve algorithm for coevolution

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
GP-Gammon: Genetically Programming Backgammon Players

Genetic Programming and Evolvable Machines
Coevolution of neural networks using a layered pareto archive

Proceedings of the 8th annual conference on Genetic and evolutionary computation
A Monotonic Archive for Pareto-Coevolution

Evolutionary Computation
New methods for competitive coevolution

Evolutionary Computation
Emergent geometric organization and informative dimensions in coevolutionary algorithms

Emergent geometric organization and informative dimensions in coevolutionary algorithms
Evolving strategy for a probabilistic game of imperfect information using genetic programming

Genetic Programming and Evolvable Machines
Some studies in machine learning using the game of checkers

IBM Journal of Research and Development
A game-theoretic memory mechanism for coevolution

GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartI
Evolution of an efficient search algorithm for the mate-in-N problem in chess

EuroGP'07 Proceedings of the 10th European conference on Genetic programming
Winning ant wars: evolving a human-competitive game strategy using fitnessless selection

EuroGP'08 Proceedings of the 11th European conference on Genetic programming
Coevolution versus self-play temporal difference learning for acquiring position evaluation in small-board go

IEEE Transactions on Evolutionary Computation
Real-time neuroevolution in the NERO video game

IEEE Transactions on Evolutionary Computation

Learning n-tuple networks for othello by coevolutionary gradient search

Proceedings of the 13th annual conference on Genetic and evolutionary computation
Evolving small-board Go players using coevolutionary temporal difference learning with archives

International Journal of Applied Mathematics and Computer Science
Autonomous shaping via coevolutionary selection of training experience

PPSN'12 Proceedings of the 12th international conference on Parallel Problem Solving from Nature - Volume Part II
Improving coevolution by random sampling

Proceedings of the 15th annual conference on Genetic and evolutionary computation
Quantitative analysis of the hall of fame coevolutionary archives

Proceedings of the 15th annual conference companion on Genetic and evolutionary computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents Coevolutionary Temporal Difference Learning (CTDL), a novel way of hybridizing co-evolutionary search with reinforcement learning that works by interlacing one-population competitive coevolution with temporal difference learning. The coevolutionary part of the algorithm provides for exploration of the solution space, while the temporal difference learning performs its exploitation by local search. We apply CTDL to the board game of Othello, using weighted piece counter for representing players' strategies. The results of an extensive computational experiment demonstrate CTDL's superiority when compared to coevolution and reinforcement learning alone, particularly when coevolution maintains an archive to provide historical progress. The paper investigates the role of the relative intensity of coevolutionary search and temporal difference search, which turns out to be an essential parameter. The formulation of CTDL leads also to the introduction of Lamarckian form of coevolution, which we discuss in detail.