Novelty of behaviour as a basis for the neuro-evolution of operant reward learning

Authors:
Andrea Soltoggio;Ben Jones
Affiliations:
EPFL, Lausanne, Switzerland;University of Birmingham, Birmingham, United Kingdom
Venue:
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Year:
2009

Citing 8
Cited 4

Neural network for robotic control

Neural network for robotic control
Handbook of Evolutionary Computation

Handbook of Evolutionary Computation
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neural Networks for Identification, Prediction, and Control

Neural Networks for Identification, Prediction, and Control
Evolution of Plastic Control Networks

Autonomous Robots
Evolution of reinforcement learning in uncertain environments: a simple explanation for complex foraging behaviors

Adaptive Behavior
Neural Plasticity and Minimal Topologies for Reward-Based Learning

HIS '08 Proceedings of the 2008 8th International Conference on Hybrid Intelligent Systems
Exploring the T-Maze: evolving learning-like robot behaviors using CTRNNs

EvoWorkshops'03 Proceedings of the 2003 international conference on Applications of evolutionary computing

Revising the evolutionary computation abstraction: minimal criteria novelty search

Proceedings of the 12th annual conference on Genetic and evolutionary computation
Efficiently evolving programs through the search for novelty

Proceedings of the 12th annual conference on Genetic and evolutionary computation
Abandoning objectives: Evolution through the search for novelty alone

Evolutionary Computation
On the relationships between synaptic plasticity and generative systems

Proceedings of the 13th annual conference on Genetic and evolutionary computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

An agent that deviates from a usual or previous course of action can be said to display novel or varying behaviour. Novelty of behaviour can be seen as the result of real or apparent randomness in decision making, which prevents an agent from repeating exactly past choices. In this paper, novelty of behaviour is considered as an evolutionary precursor of the exploring skill in reward learning, and conservative behaviour as the precursor of exploitation. Novelty of behaviour in neural control is hypothesised to be an important factor in the neuro-evolution of operant reward learning. Agents capable of varying behaviour, as opposed to conservative, when exposed to reward stimuli appear to acquire on a faster evolutionary scale the meaning and use of such reward information. The hypothesis is validated by comparing the performance during evolution in two environments that either favour or are neutral to novelty. Following these findings, we suggest that neuro-evolution of operant reward learning is fostered by environments where behavioural novelty is intrinsically beneficial, i.e. where varying or exploring behaviour is associated with low risk.