An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs

  • Authors:
  • Thomas Gabel;Martin Riedmiller

  • Affiliations:
  • Neuroinformatics Group, Department of Mathematics and Computer Science, Institute of Cognitive Science, University of Osnabrück, 49069 Osnabrück, Germany;Neuroinformatics Group, Department of Mathematics and Computer Science, Institute of Cognitive Science, University of Osnabrück, 49069 Osnabrück, Germany

  • Venue:
  • ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We identify two fundamental points of utilizing CBR for an adaptive agent that tries to learn on the basis of trial and error without a model of its environment. The first link concerns the utmost efficient exploitation of experience the agent has collected by interacting within its environment, while the second relates to the acquisition and representation of a suitable behavior policy. Combining both connections, we develop a state-action value function approximation mechanism that relies on case-based, approximate transition graphs and forms the basis on which the agent improves its behavior. We evaluate our approach empirically in the context of dynamic control tasks.