Effective short-term opponent exploitation in simplified poker

Authors:
Finnegan Southey;Bret Hoehn;Robert C. Holte
Affiliations:
Google, Mountain View, USA 94043;Dept. of Computing Science, University of Alberta, Edmonton, Canada T6G 2E8;Dept. of Computing Science, University of Alberta, Edmonton, Canada T6G 2E8
Venue:
Machine Learning
Year:
2009

Citing 15
Cited 2

Artificial intelligence: a modern approach

Artificial intelligence: a modern approach
Game theory, on-line prediction and boosting

COLT '96 Proceedings of the ninth annual conference on Computational learning theory
Representations and solutions for game-theoretic problems

Artificial Intelligence - Special issue on economic principles of multi-agent systems
Gambling in a rigged casino: The adversarial multi-armed bandit problem

FOCS '95 Proceedings of the 36th Annual Symposium on Foundations of Computer Science
Theoretical guarantees for algorithms in multi-agent settings

Theoretical guarantees for algorithms in multi-agent settings
Lossless abstraction of imperfect information games

Journal of the ACM (JACM)
Better automated abstraction techniques for imperfect information games, with application to Texas Hold'em poker

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
A competitive Texas Hold'em poker player via automated abstraction and real-time equilibrium computation

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Effective short-term opponent exploitation in simplified poker

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Optimal Rhode Island Hold'em poker

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 4
Potential-aware automated abstraction of sequential games, and holistic equilibrium analysis of Texas Hold'em poker

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
A new algorithm for generating equilibria in massive zero-sum games

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Approximating game-theoretic optimal strategies for full-scale poker

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Bayesian poker

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Game-Tree search with adaptation in stochastic imperfect-information games

CG'04 Proceedings of the 4th international conference on Computers and Games

Affective agents for empathic interactions

ICEC'11 Proceedings of the 10th international conference on Entertainment Computing
Multiagent learning in the presence of memory-bounded agents

Autonomous Agents and Multi-Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Uncertainty in poker stems from two key sources, the shuffled deck and an adversary whose strategy is unknown. One approach to playing poker is to find a pessimistic game-theoretic solution (i.e., a Nash equilibrium), but human players have idiosyncratic weaknesses that can be exploited if some model or counter-strategy can be learned by observing their play. However, games against humans last for at most a few hundred hands, so learning must be very fast to be useful. We explore two approaches to opponent modelling in the context of Kuhn poker, a small game for which game-theoretic solutions are known. Parameter estimation and expert algorithms are both studied. Experiments demonstrate that, even in this small game, convergence to maximally exploitive solutions in a small number of hands is impractical, but that good (e.g., better than Nash) performance can be achieved in as few as 50 hands. Finally, we show that amongst a set of strategies with equal game-theoretic value, in particular the set of Nash equilibrium strategies, some are preferable because they speed learning of the opponent's strategy by exploring it more effectively.