Approximate policy iteration for closed-loop learning of visual tasks

Authors:
Sébastien Jodogne;Cyril Briquet;Justus H. Piater
Affiliations:
Montefiore Institute (B28), University of Liège, Liège, Belgium;Montefiore Institute (B28), University of Liège, Liège, Belgium;Montefiore Institute (B28), University of Liège, Liège, Belgium
Venue:
ECML'06 Proceedings of the 17th European conference on Machine Learning
Year:
2006

Citing 9
Cited 2

Evaluation of Interest Point Detectors

International Journal of Computer Vision - Special issue on a special section on visual surveillance
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Least-squares policy iteration

The Journal of Machine Learning Research
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Random Subwindows for Robust Image Classification

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Tree-Based Batch Mode Reinforcement Learning

The Journal of Machine Learning Research
Task-Driven Learning of Spatial Combinations of Visual Features

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Interactive learning of mappings from visual percepts to actions

ICML '05 Proceedings of the 22nd international conference on Machine learning
Extremely randomized trees

Machine Learning

Scheduling data-intensive bags of tasks in P2P grids with bittorrent-enabled data distribution

Proceedings of the second workshop on Use of P2P, GRID and agents for the development of content networks
P2P file sharing for P2P computing

Multiagent and Grid Systems - Content management and delivery through P2P-based content networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for the closed-loop learning of mappings from images to actions. This approach requires a family of function approximators that maps visual percepts to a real-valued function. For this purpose, we use Regression Extra-Trees, a fast, yet accurate and versatile machine learning algorithm. The inputs of the Extra-Trees consist of a set of visual features that digest the informative patterns in the visual signal. We also show how to parallelize the Extra-Tree learning process to further reduce the computational expense, which is often essential in visual tasks. Experimental results on real-world images are given that indicate that the combination of API with Extra-Trees is a promising framework for the interactive learning of visual tasks.