Solving two-person zero-sum repeated games of incomplete information

Authors:
Andrew Gilpin;Tuomas Sandholm
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA
Venue:
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Year:
2008

Citing 7
Cited 1

Repeated games, duality and the central limit theorem

Mathematics of Operations Research
Primal-dual interior-point methods

Primal-dual interior-point methods
"Cav U" and the Dual Game

Mathematics of Operations Research
A near-optimal polynomial time algorithm for learning in certain classes of stochastic games

Artificial Intelligence
A polynomial-time nash equilibrium algorithm for repeated games

Proceedings of the 4th ACM conference on Electronic commerce
Nash q-learning for general-sum stochastic games

The Journal of Machine Learning Research
Perspectives on multiagent learning

Artificial Intelligence

Hustling in repeated zero-sum games with imperfect execution

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One

Quantified Score

Hi-index	0.00

Visualization

Abstract

In repeated games with incomplete information, rational agents must carefully weigh the tradeoffs of advantageously exploiting their information to achieve a short-term gain versus carefully concealing their information so as not to give up a long-term informed advantage. The theory of infinitely-repeated two-player zero-sum games with incomplete information has been carefully studied, beginning with the seminal work of Aumann and Maschler. While this theoretical work has produced a characterization of optimal strategies, algorithms for solving for optimal strategies have not yet been studied. For the case where one player is informed about the true state of the world and the other player is uninformed, we provide a non-convex mathematical programming formulation for computing the value of the game, as well as optimal strategies for the informed player. We then describe an efficient algorithm for solving this difficult optimization problem to within arbitrary accuracy. We also discuss how to efficiently compute optimal strategies for the uninformed player using the output of our algorithm.