Maximum causal entropy correlated equilibria for Markov games

  • Authors:
  • Brian D. Ziebart;J. Andrew Bagnell;Anind K. Dey

  • Affiliations:
  • Carnegie Mellon University;Carnegie Mellon University;Carnegie Mellon University

  • Venue:
  • The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

Motivated by a machine learning perspective---that game-theoretic equilibria constraints should serve as guidelines for predicting agents' strategies, we introduce maximum causal entropy correlated equilibria (MCECE), a novel solution concept for general-sum Markov games. In line with this perspective, a MCECE strategy profile is a uniquely-defined joint probability distribution over actions for each game state that minimizes the worst-case prediction of agents' actions under log-loss. Equivalently, it maximizes the worst-case growth rate for gambling on the sequences of agents' joint actions under uniform odds. We present a convex optimization technique for obtaining MCECE strategy profiles that resembles value iteration in finite-horizon games. We assess the predictive benefits of our approach by predicting the strategies generated by previously proposed correlated equilibria solution concepts, and compare against those previous approaches on that same prediction task.