Learning by demonstration in repeated stochastic games

  • Authors:
  • Jacob W. Crandall;Malek H. Altakrori;Yomna M. Hassan

  • Affiliations:
  • Masdar Institute of Science and Technology, Abu Dhabi, UAE;Masdar Institute of Science and Technology, Abu Dhabi, UAE;Masdar Institute of Science and Technology, Abu Dhabi, UAE

  • Venue:
  • The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Despite much research in recent years, newly created multiagent learning (MAL) algorithms continue to have one or more fatal weaknesses. These weaknesses include slow learning rates, failure to learn non-myopic solutions, and inability to scale up to domains with many actions, states, and associates. To overcome these weaknesses, we argue that fundamentally different approaches to MAL should be developed. One possibility is to develop methods that allow people to teach learning agents. To begin to determine the usefulness of this approach, we explore the effectiveness of learning by demonstration (LbD) in repeated stochastic games.