Approximate policy iteration for closed-loop learning of visual tasks

  • Authors:
  • Sébastien Jodogne;Cyril Briquet;Justus H. Piater

  • Affiliations:
  • Montefiore Institute (B28), University of Liège, Liège, Belgium;Montefiore Institute (B28), University of Liège, Liège, Belgium;Montefiore Institute (B28), University of Liège, Liège, Belgium

  • Venue:
  • ECML'06 Proceedings of the 17th European conference on Machine Learning
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for the closed-loop learning of mappings from images to actions. This approach requires a family of function approximators that maps visual percepts to a real-valued function. For this purpose, we use Regression Extra-Trees, a fast, yet accurate and versatile machine learning algorithm. The inputs of the Extra-Trees consist of a set of visual features that digest the informative patterns in the visual signal. We also show how to parallelize the Extra-Tree learning process to further reduce the computational expense, which is often essential in visual tasks. Experimental results on real-world images are given that indicate that the combination of API with Extra-Trees is a promising framework for the interactive learning of visual tasks.