Accelerating autonomous learning by using heuristic selection of actions

  • Authors:
  • Reinaldo A. Bianchi;Carlos H. Ribeiro;Anna H. Costa

  • Affiliations:
  • Centro Universitário da FEI, São Bernardo do Campo, Brazil 09850-901;Instituto Tecnológico de Aeronáutica, São José dos Campos, Brazil 12228-900;Escola Politécnica da Universidade de São Paulo, São Paulo, Brazil 05508-900

  • Venue:
  • Journal of Heuristics
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper investigates how to make improved action selection for online policy learning in robotic scenarios using reinforcement learning (RL) algorithms. Since finding control policies using any RL algorithm can be very time consuming, we propose to combine RL algorithms with heuristic functions for selecting promising actions during the learning process. With this aim, we investigate the use of heuristics for increasing the rate of convergence of RL algorithms and contribute with a new learning algorithm, Heuristically Accelerated Q-learning (HAQL), which incorporates heuristics for action selection to the Q-Learning algorithm. Experimental results on robot navigation show that the use of even very simple heuristic functions results in significant performance enhancement of the learning rate.