Game theory-based opponent modeling in large imperfect-information games

Authors:
Sam Ganzfried;Tuomas Sandholm
Affiliations:
Carnegie Mellon University;Carnegie Mellon University
Venue:
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Year:
2011

Citing 7
Cited 2

Perspectives on multiagent learning

Artificial Intelligence
A competitive Texas Hold'em poker player via automated abstraction and real-time equilibrium computation

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Effective short-term opponent exploitation in simplified poker

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Potential-aware automated abstraction of sequential games, and holistic equilibrium analysis of Texas Hold'em poker

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Bayes-relational learning of opponent models from incomplete information in no-limit poker

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Approximating game-theoretic optimal strategies for full-scale poker

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Gradient-based algorithms for finding Nash equilibria in extensive form games

WINE'07 Proceedings of the 3rd international conference on Internet and network economics

Safe opponent exploitation

Proceedings of the 13th ACM Conference on Electronic Commerce
Online implicit agent modelling

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We develop an algorithm for opponent modeling in large extensive-form games of imperfect information. It works by observing the opponent's action frequencies and building an opponent model by combining information from a precomputed equilibrium strategy with the observations. It then computes and plays a best response to this opponent model; the opponent model and best response are both updated continually in real time. The approach combines game-theoretic reasoning and pure opponent modeling, yielding a hybrid that can effectively exploit opponents after only a small number of interactions. Unlike prior opponent modeling approaches, ours is fundamentally game theoretic and takes advantage of recent algorithms for automated abstraction and equilibrium computation rather than relying on domain-specific prior distributions, historical data, or a handcrafted set of features. Experiments show that our algorithm leads to significantly higher win rates (than an approximate-equilibrium strategy) against several opponents in limit Texas Hold'em --- the most studied imperfect-information game in computer science --- including competitors from recent AAAI computer poker competitions.