Unsupervised Learning in Metagame

Authors:
Graham E. Farr;David R. Powell
Affiliations:
-;-
Venue:
AI '99 Proceedings of the 12th Australian Joint Conference on Artificial Intelligence: Advanced Topics in Artificial Intelligence
Year:
1999

Citing 7
Cited 0

The expected-outcome model of two-player games

The expected-outcome model of two-player games
Practical Issues in Temporal Difference Learning

Machine Learning
TD-Gammon, a self-teaching backgammon program, achieves master-level play

Neural Computation
A strategic metagame player for general chess-like games

AAAI'94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 2)
Learning to Predict by the Methods of Temporal Differences

Machine Learning
GENERAL GAME-PLAYING AND REINFORCEMENT LEARNING

GENERAL GAME-PLAYING AND REINFORCEMENT LEARNING
Some studies in machine learning using the game of checkers

IBM Journal of Research and Development

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Metagame approach to computer game playing, introduced by Pell, involves writing programs that can play many games from some laxge class, rather thein programs speciailised to play just a single game such as chess. Metagame programs take the rules of a randomly generated game as input, then do some analysis of that game, and then play the game against an opponent. Success in Metagame competitions is evidence of a more general kind of abiUty than that possessed by (for example) a chess program or a draughts program. In this paper, we take up one of Pell's challenges by building a Metagame player that can learn. The learning techniques used axe a refinement of the regression methods of Christensen and Korf, and they are applied to unsupervised learning, from self-play, of the weights of the components (or advisors) of the evaluation function. The method used leads to significant improvement in playing strength for many (but not all) games in the class. We also shed light on some curious behaviour of some advisor weights. In order to conduct this research, a new and more efficient Metagame player was written.