Learning to Predict Variable-Delay Rewards and Its Role in Autonomous Developmental Robotics

Authors:
Andrés Pérez-Uribe;Michele Courant
Affiliations:
-;-
Venue:
IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
Year:
2001

Citing 5
Cited 0

What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?

Neural Networks - Special issue on organisation of computation in brain-like systems
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Mobile Robot Miniaturisation: A Tool for Investigation in Control Algorithms

The 3rd International Symposium on Experimental Robotics III
Using a Time-Delay Actor-Critic Neural Architecture with Dopamine-Like Reinforcement Signal for Learning in Autonomous Robots

Emergent Neural Computational Architectures Based on Neuroscience - Towards Neuroscience-Inspired Computing
A phylogenetic, ontogenetic, and epigenetic view of bio-inspired hardware systems

IEEE Transactions on Evolutionary Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Researchers in the new field of "developmental robotics" propose to provide robots with so-called developmental programs. Similar to the development of human infants, robots might use those programs to interact with humans and their environment for extended periods of time, and become smarter autonomously, In this paper we show how a neural network model developed by neuroscientists can be used by an autonomous robot to learn by trial-and-error when considering rewards delivered at arbitrary times, as would be the case of developmental robots interacting with humans in the real world.