As Safe As It Gets: Near-Optimal Learning in Multi-Stage Games with Imperfect Monitoring

Authors:
Danny Kuminov;Moshe Tennenholtz
Affiliations:
Technion --Israel Institute of Technology, Haifa, Israel 32000. Email: dannykv@tx.technion.ac.il;Technion --Israel Institute of Technology, Haifa, Israel 32000. Email: moshet@ie.technion.ac.il
Venue:
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Year:
2008

Citing 4
Cited 1

The Nonstochastic Multiarmed Bandit Problem

SIAM Journal on Computing
R-max - a general polynomial time algorithm for near-optimal reinforcement learning

The Journal of Machine Learning Research
Regret minimizing equilibria and mechanisms for games with strict type uncertainty

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Rational and convergent learning in stochastic games

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2

Orchestrating multiagent learning of penalty games

SBIA'12 Proceedings of the 21st Brazilian conference on Advances in Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce the first near-optimal polynomial algorithm for obtaining the mixed safety level value of an initially unknown multi-stage game, played in a hostile environment, under imperfect monitoring. In an imperfect monitoring setting all that an agent can observe is the current state and its own actions and payoffs, but it can not observe other agents' actions. Our result holds for any multi-stage generic game with a “reset” action.