Isotropic sequence order learning

Authors:
Bernd Porr;Florentin Wörgötter
Affiliations:
Department of Psychology, University of Stirling, Stirling FK9 4LA, Scotland;Department of Psychology, University of Stirling, Stirling FK9 4LA, Scotland
Venue:
Neural Computation
Year:
2003

Citing 13
Cited 21

A drive-reinforcement model of single neuron function: An alternative to the Hebbian neuronal model

AIP Conference Proceedings 151 on Neural Networks for Computing
Differential Hebbian learning

AIP Conference Proceedings 151 on Neural Networks for Computing
Neural dynamics of adaptive timing temporal discrimination during associative learning

Neural Networks
Categorization, representations, and the dynamics of system-environment interaction: a case study in autonomous systems

Proceedings of the second international conference on From animals to animats 2 : simulation of adaptive behavior: simulation of adaptive behavior
TD(λ) Converges with Probability 1

Machine Learning
A bottom up approach towards the acquisition and expression of sequential representions applied to a behaving real-world device: Distributed Adaptive Control III

Neural Networks - Special issue on neural control and robotics: biology and technology
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Temporal Difference Model Reproduces Anticipatory Neural Activity

Neural Computation
MOSAIC Model for Sensorimotor Learning and Control

Neural Computation
Spike-Timing-Dependent Hebbian Plasticity as Temporal Difference Learning

Neural Computation
Modeling Synaptic Plasticity in Conjunction with the Timing of Pre- and Postsynaptic Action Potentials

Neural Computation
Reinforcement Learning in Continuous Time and Space

Neural Computation
The hippocampus and cerebellum in adaptively timed learning, recognition, and movement

Journal of Cognitive Neuroscience

How the shape of pre- and postsynaptic signals can influence STDP: a biophysical model

Neural Computation
Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms

Neural Computation
A Reflexive Neural Network for Dynamic Biped Walking Control

Neural Computation
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only

Neural Computation
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only

Neural Computation
Improved stability and convergence with three factor learning

Neurocomputing
Reducing the Variability of Neural Responses: A Computational Theory of Spike-Timing-Dependent Plasticity

Neural Computation
Learning with “Relevance”: Using a Third Factor to Stabilize Hebbian Learning

Neural Computation
Discretization of ISO-Learning and ICO-Learning to Be Included into Reactive Neural Networks for a Robotics Simulator

IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Second Order Conditioning in the Sub-cortical Nuclei of the Limbic System

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Discretized ISO-learning neural network for obstacle avoidance in reactive robot controllers

Neurocomputing
Letters: Rate coding of spike-timing dependent plasticity: Activity-variation-timing dependent plasticity (AVTDP)

Neurocomputing
A spiking neural network model of an actor-critic learning agent

Neural Computation
Cognitive agents - a procedural perspective relying on the predictability of Object-Action-Complexes (OACs)

Robotics and Autonomous Systems
Unifying perceptual and behavioral learning with a correlative subspace learning rule

Neurocomputing
Learning and Reversal Learning in the Subcortical Limbic System: A Computational Model

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Communicating emotions and mental states to robots in a real time parallel framework using Laban movement analysis

Robotics and Autonomous Systems
2011 Special Issue: How feedback inhibition shapes spike-timing-dependent plasticity and its implications for recent Schizophrenia models

Neural Networks
Stabilising hebbian learning with a third factor in a food retrieval task

SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Novel method for using Q-learning in small microcontrollers

Proceedings of the 51st ACM Southeast Conference
Supervised spike-timing-dependent plasticity: A spatiotemporal neuronal learning rule for function approximation and decisions

Neural Computation

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this article, we present an isotropic unsupervised algorithm for temporal sequence learning. No special reward signal is used such that all inputs are completely isotropic. All input signals are bandpass filtered before converging onto a linear output neuron. All synaptic weights change according to the correlation of bandpass-filtered inputs with the derivative of the output. We investigate the algorithm in an open- and a closed-loop condition, the latter being defined by embedding the learning system into a behavioral feedback loop. In the open-loop condition, we find that the linear structure of the algorithm allows analytically calculating the shape of the weight change, which is strictly heterosynaptic and follows the shape of the weight change curves found in spike-time-dependent plasticity. Furthermore, we show that synaptic weights stabilize automatically when no more temporal differences exist between the inputs without additional normalizing measures. In the second part of this study, the algorithm is is placed in an environment that leads to closed sensor-motor loop. To this end, a robot is programmed with a prewired retraction reflex reaction in response to collisions. Through isotropic sequence order (ISO) learning, the robot achieves collision avoidance by learning the correlation between his early range-finder signals and the later occurring collision signal. Synaptic weights stabilize at the end of learning as theoretically predicted. Finally, we discuss the relation of ISO learning with other drive reinforcement models and with the commonly used temporal difference learning algorithm. This study is followed up by a mathematical analysis of the closed-loop situation in the companion article in this issue, "ISO Learning Approximates a Solution to the Inverse-Controller Problem in an Unsupervised Behavioral Paradigm" (pp. 865-884).