First-order algorithm with O(ln(1/ε )) convergence for ε -equilibrium in two-person zero-sum games

Authors:
Andrew Gilpin;Javier Peña;Thomas Sandholm
Affiliations:
Computer Science Department, Carnegie Mellon University, Pittsburgh, PA;Tepper School of Business, Carnegie Mellon University, Pittsburgh, PA;Computer Science Department, Carnegie Mellon University, Pittsburgh, PA
Venue:
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Year:
2008

Citing 6
Cited 4

Primal-dual interior-point methods

Primal-dual interior-point methods
Smooth minimization of non-smooth functions

Mathematical Programming: Series A and B
Excessive Gap Technique in Nonsmooth Convex Minimization

SIAM Journal on Optimization
Potential-aware automated abstraction of sequential games, and holistic equilibrium analysis of Texas Hold'em poker

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
A new algorithm for generating equilibria in massive zero-sum games

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Gradient-based algorithms for finding Nash equilibria in extensive form games

WINE'07 Proceedings of the 3rd international conference on Internet and network economics

Expectation-based versus potential-aware automated abstraction in imperfect information games: an experimental comparison using poker

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Speeding up gradient-based algorithms for sequential games

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Applying Metric Regularity to Compute a Condition Measure of a Smoothing Algorithm for Matrix Games

SIAM Journal on Optimization
Near-optimal no-regret algorithms for zero-sum games

Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose an iterated version of Nesterov's first-order smoothing method for the two-person zero-sum game equilibrium problem minx∈Q1 maxy∈Q2 xT Ay = maxy∈Q2 minx∈Q1 xTAy. This formulation applies to matrix games as well as sequential games. Our new algorithmic scheme computes an Ε-equilibrium to this min-max problem in O(κ(A) In(1/Ε)) first-order iterations, where κ(A) is a certain condition measure of the matrix A. This improves upon the previous first-order methods which required O(1/Ε) iterations, and it matches the iteration complexity bound of interior-point methods in terms of the algorithm's dependence on Ε. Unlike the interior-point methods that are inapplicable to large games due to their memory requirements, our algorithm retains the small memory requirements of prior first-order methods. Our scheme supplements Nesterov's algorithm with an outer loop that lowers the target Ε between iterations (this target affects the amount of smoothing in the inner loop). We find it surprising that such a simple modification yields an exponential speed improvement. Finally, computational experiments both in matrix games and sequential games show that a significant speed improvement is obtained in practice as well, and the relative speed improvement increases with the desired accuracy (as suggested by the complexity bounds).