Speeding up gradient-based algorithms for sequential games

Authors:
Andrew Gilpin;Tuomas Sandholm
Affiliations:
Hg Analytics, LLC, New York, NY;Carnegie Mellon University, Pittsburgh, PA
Venue:
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Year:
2010

Citing 5
Cited 1

Excessive Gap Technique in Nonsmooth Convex Minimization

SIAM Journal on Optimization
Potential-aware automated abstraction of sequential games, and holistic equilibrium analysis of Texas Hold'em poker

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
A new algorithm for generating equilibria in massive zero-sum games

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
First-order algorithm with O(ln(1/ε )) convergence for ε -equilibrium in two-person zero-sum games

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Gradient-based algorithms for finding Nash equilibria in extensive form games

WINE'07 Proceedings of the 3rd international conference on Internet and network economics

Computer poker: A review

Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

First-order (i.e., gradient-based) methods for solving two-person zero-sum sequential games of imperfect information have recently become important tools in the construction of game theory-based agents. The computation time per iteration is typically dominated by matrix-vector product operations involving the payoff matrix A. In this paper, we describe two techniques for scaling up this operation. The first is a randomized sampling technique that approximates A with a sparser matrix Ã. Then an approximate equilibrium for the original game is found by finding an approximate equilibrium of the sampled game. The second technique involves the development of an algorithm and system for performing the matrix-vector product on a cache-coherent Non-Uniform Memory Access (ccNUMA) architecture. The two techniques can be applied together or separately, and they each lead to an algorithm that significantly outperforms the fastest prior gradient-based method.