Application of SONQL for real-time learning of robot behaviors

  • Authors:
  • Marc Carreras;Junku Yuh;Joan Batlle;Pere Ridao

  • Affiliations:
  • Institute of Informatics and Applications, University of Girona, Edifici PIV, Campus Montilivi, 17071 Girona, Spain;National Science Foundation, 4201 Wilson Blvd. Suit 1125, Arlington, VA 22230, USA;Institute of Informatics and Applications, University of Girona, Edifici PIV, Campus Montilivi, 17071 Girona, Spain;Institute of Informatics and Applications, University of Girona, Edifici PIV, Campus Montilivi, 17071 Girona, Spain

  • Venue:
  • Robotics and Autonomous Systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes the Semi-Online Neural-Q-learning (SONQL) algorithm designed for real-time learning of reactive robot behaviors. The Q-function is generalized by a multilayer neural network allowing the use of continuous states. The algorithm uses a database of the most recent learning samples to accelerate and improve the convergence. Each SONQL algorithm represents an independent, reactive and adaptive state-action mapping, which implements the function of a robot behavior for one degree of freedom (DOF). The generalization capability of the SONQL algorithm was demonstrated by computer simulation with the ''mountain-car'' benchmark. The SONQL was also investigated by experiment on a mobile robot for a target-following task. Experimental results show that the SONQL is promising for online robot learning.