Learning cost-efficient control policies with XCSF: generalization capabilities and further improvement

  • Authors:
  • Didier Marin;Jérémie Decock;Lionel Rigoux;Olivier Sigaud

  • Affiliations:
  • Institut des Systèmes Intelligents et de Robotique, Paris, France;Institut des Systèmes Intelligents et de Robotique, Paris, France;Institut des Systèmes Intelligents et de Robotique, Paris, France;Institut des Systèmes Intelligents et de Robotique, Paris, France

  • Venue:
  • Proceedings of the 13th annual conference on Genetic and evolutionary computation
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper we present a method based on the "learning from demonstration" paradigm to get a cost-efficient control policy in a continuous state and action space. The controlled plant is a two degrees-of-freedom planar arm actuated by six muscles. We learn a parametric control policy with XCSF from a few near-optimal trajectories, and we study its capability to generalize over the whole reachable space. Furthermore, we show that an additional Cross-Entropy Policy Search method can improve the global performance of the parametric controller.