Optimistic agents are asymptotically optimal

  • Authors:
  • Peter Sunehag;Marcus Hutter

  • Affiliations:
  • Research School of Computer Science, Australian National University, Canberra, Australia;Research School of Computer Science, Australian National University, Canberra, Australia

  • Venue:
  • AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We use optimism to introduce generic asymptotically optimal reinforcement learning agents. They achieve, with an arbitrary finite or compact class of environments, asymptotically optimal behavior. Furthermore, in the finite deterministic case we provide finite error bounds.