A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures

Authors:
Edmund T. Rolls;T. T. Milward
Affiliations:
Oxford University, Department of Experimental Psychology, Oxford OX1 3UD, England;Oxford University, Department of Experimental Psychology, Oxford OX1 3UD, England
Venue:
Neural Computation
Year:
2000

Citing 8
Cited 14

Learning invariance from transformation sequences

Neural Computation
Optimal, unsupervised learning in invariant object recognition

Neural Computation
A neurophysiological and computational approach to the functions of the temporal lobe cortical visual areas in invariant object recognition

Computational and psychophysical mechanisms of visual coding
Transform-invariant recognition by association in a recurrent network

Neural Computation
Firing rate distributions and efficiency of information transmission of inferior temporal cortex neurons to natural visual stimuli

Neural Computation
On decoding the responses of a population of neurons from short time windows

Neural Computation
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Learning to Predict by the Methods of Temporal Differences

Machine Learning

Invariant object recognition in the visual system with novel views of 3D objects

Neural Computation
An Adaptive Hierarchical Model of the Ventral Visual Pathway Implemented on a Mobile Robot

BMCV '02 Proceedings of the Second International Workshop on Biologically Motivated Computer Vision
Learning optimized features for hierarchical models of invariant object recognition

Neural Computation
A neural model for biological movement recognition: a neurophysiologically plausible theory

Optic flow and beyond
Learning Viewpoint Invariant Perceptual Representations from Cluttered Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
2006 Special Issue: Attention in natural scenes: Neurophysiological and computational bases

Neural Networks
Invariant global motion recognition in the dorsal visual system: a unifying theory

Neural Computation
How to Compare Two Quantities? A Computational Model of Flutter Discrimination

Journal of Cognitive Neuroscience
Visual Recognition and Inference Using Dynamic Overcomplete Sparse Learning

Neural Computation
A biologically motivated visual memory architecture for online learning of objects

Neural Networks
Learning transform invariant object recognition in the visual system with multiple stimuli present during training

Neural Networks
Multimodal feedforward self-organizing maps

CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
Object recognition with statistically independent features: a model inspired by the primate visual cortex

RoboCup 2009
Relative spike time coding and STDP-based orientation selectivity in the early visual system in natural continuous and saccadic vision: a computational model

Journal of Computational Neuroscience

Quantified Score

Hi-index	0.00

Visualization

Abstract

VisNet2 is a model to investigate some aspects of invariant visual object recognition in the primate visual system. It is a four-layer feedforward network with convergence to each part of a layer from a small region of the preceding layer, with competition between the neurons within a layer and with a trace learning rule to help it learn transform invariance. The trace rule is a modified Hebbian rule, which modifies synaptic weights according to both the current firing rates and the firing rates to recently seen stimuli. This enables neurons to learn to respond similarly to the gradually transforming inputs it receives, which over the short term are likely to be about the same object, given the statistics of normal visual inputs. First, we introduce for VisNet2 both single-neuron and multiple-neuron information-theoretic measures of its ability to respond to transformed stimuli. Second, using these measures, we show that quantitatively resetting the trace between stimuli is not necessary for good performance. Third, it is shown that the sigmoid activation functions used in VisNet2, which allow the sparseness of the representation to be controlled, allow good performance when using sparse distributed representations. Fourth, it is shown that VisNet2 operates well with medium-range lateral inhibition with a radius in the same order of size as the region of the preceding layer from which neurons receive inputs. Fifth, in an investigation of different learning rules for learning transform invariance, it is shown that VisNet2 operates better with a trace rule that incorporates in the trace only activity from the preceding presentations of a given stimulus, with no contribution to the trace from the current presentation, and that this is related to temporal difference learning.