Optimal efficient learning equilibrium: imperfect monitoring in symmetric games

Authors:
Ronen I. Brafman;Moshe Tennenholtz
Affiliations:
Dept. of Computer Science, Ben-Gurion, University, Beer-Sheva, Israel and Department of Computer Science, Stanford University, Stanford, CA;Faculty of Industrial Eng. & Management, Technion, Haifa, Israel
Venue:
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Year:
2005

Citing 7
Cited 5

The dynamics of reinforcement learning in cooperative multiagent systems

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Multiagent Systems: A Survey from a Machine Learning Perspective

Autonomous Robots
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
R-max - a general polynomial time algorithm for near-optimal reinforcement learning

The Journal of Machine Learning Research
Efficient learning equilibrium

Artificial Intelligence
Reinforcement learning: a survey

Journal of Artificial Intelligence Research
Rational and convergent learning in stochastic games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2

AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents

Machine Learning
Perspectives on multiagent learning

Artificial Intelligence
Learning equilibrium as a generalization of learning to optimize

Artificial Intelligence
Learning equilibria in repeated congestion games

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Learning equilibrium in resource selection games

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1

Quantified Score

Hi-index	0.00

Visualization

Abstract

Efficient Learning Equilibrium (ELE) is a natural solution concept for multi-agent encounters with incomplete information. It requires the learning algorithms themselves to be in equilibrium for any game selected from a set of (initially unknown) games. In an optimal ELE, the learning algorithms would efficiently obtain the surplus the agents would obtain in an optimal Nash equilibrium of the initially unknown game which is played. The crucial part is that in an ELE deviations from the learning algorithms would become nonbeneficial after polynomial time, although the game played is initially unknown. While appealing conceptually, the main challenge for establishing learning algorithms based on this concept is to isolate general classes of games where an ELE exists. Unfortunately, it has been shown that while an ELE exists for the setting in which each agent can observe all other agents' actions and payoffs, an ELE does not exist in general when the other agents' payoffs cannot be observed. In this paper we provide the first positive results on this problem, constructively proving the existence of an optimal ELE for the class of symmetric games where an agent can not observe other agents' payoffs.