Markovian architectural bias of recurrent neural networks

Authors:
P. Tino;M. Cernansky;L. Benuskova
Affiliations:
Sch. of Comput. Sci., Univ. of Birmingham, UK;-;-
Venue:
IEEE Transactions on Neural Networks
Year:
2004

Citing 0
Cited 34

The Applicability of Recurrent Neural Networks for Biological Sequence Analysis

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Rule Extraction from Recurrent Neural Networks: A Taxonomy and Review

Neural Computation
Dynamics and Topographic Organization of Recursive Self-Organizing Maps

Neural Computation
Organization of the state space of a simple recurrent network before and after training on recursive linguistic structures

Neural Networks
The Crystallizing Substochastic Sequential Machine Extractor: CrySSMEx

Neural Computation
2007 Special Issue: Online design of an echo state network based wide area monitor for a multimachine power system

Neural Networks
Elman Backpropagation as Reinforcement for Simple Recurrent Networks

Neural Computation
State estimation for jumping recurrent neural networks with discrete and distributed delays

Neural Networks
Letters: Dynamics analysis of impulsive stochastic Cohen-Grossberg neural networks with Markovian jumping and mixed time delays

Neurocomputing
On Global Stability of Delayed BAM Stochastic Neural Networks with Markovian Switching

Neural Processing Letters
Stability and synchronization of discrete-time Markovian jumping neural networks with mixed mode-dependent time delays

IEEE Transactions on Neural Networks
A robust extended Elman backpropagation algorithm

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Comparison of echo state networks with simple recurrent networks and variable-length Markov models on symbolic sequences

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Training recurrent connectionist models on symbolic time series

ICONIP'08 Proceedings of the 15th international conference on Advances in neuro-information processing - Volume Part I
Improving the state space organization of untrained recurrent networks

ICONIP'08 Proceedings of the 15th international conference on Advances in neuro-information processing - Volume Part I
Stochastic stability analysis of neutral-type impulsive neural networks with mixed time-varying delays and Markovian jumping

Neurocomputing
Architectural and Markovian factors of echo state networks

Neural Networks
Dynamic background discrimination with a recurrent network

ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part II
On non-markovian topographic organization of receptive fields in recursive self-organizing map

ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part II
Stability analysis for discrete-time Markovian jump neural networks with mixed time-delays

Expert Systems with Applications: An International Journal
Robust stabilization of stochastic Markovian jumping dynamical networks with mixed delays

Neurocomputing
Stability and synchronization for Markovian jump neural networks with partly unknown transition probabilities

Neurocomputing
State Estimation for Discrete-Time Neural Networks with Markov-Mode-Dependent Lower and Upper Bounds on the Distributed Delays

Neural Processing Letters
State estimation of markovian jump neural networks with mixed time delays

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
Adaptive stochastic robust convergence of neutral-type neural networks with markovian jump parameters

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
Mean square exponential stability of hybrid neural networks with uncertain switching probabilities

ICIC'12 Proceedings of the 8th international conference on Intelligent Computing Theories and Applications
Tree Echo State Networks

Neurocomputing
Global exponential estimates of delayed stochastic neural networks with Markovian switching

Neural Networks
H∞ filtering of markovian jumping neural networks with time delays

ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part I
A mode-dependent approach to state estimation of recurrent neural networks with Markovian jumping parameters and mixed delays

Neural Networks
Neural networks letter: Stochastic stability of discrete-time Markovian jump delay neural networks with impulses and incomplete information on transition probability

Neural Networks
Effects of leakage time-varying delays in Markovian jump neural networks with impulse control

Neurocomputing
Robust H∞ filter design for uncertain stochastic Markovian jump Hopfield neural networks with mode-dependent time-varying delays

Neurocomputing
pth Moment Exponential Stability of Stochastic Recurrent Neural Networks with Markovian Switching

Neural Processing Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we elaborate upon the claim that clustering in the recurrent layer of recurrent neural networks (RNNs) reflects meaningful information processing states even prior to training. By concentrating on activation clusters in RNNs, while not throwing away the continuous state space network dynamics, we extract predictive models that we call neural prediction machines (NPMs). When RNNs with sigmoid activation functions are initialized with small weights (a common technique in the RNN community), the clusters of recurrent activations emerging prior to training are indeed meaningful and correspond to Markov prediction contexts. In this case, the extracted NPMs correspond to a class of Markov models, called variable memory length Markov models (VLMMs). In order to appreciate how much information has really been induced during the training, the RNN performance should always be compared with that of VLMMs and NPMs extracted before training as the "null" base models. Our arguments are supported by experiments on a chaotic symbolic sequence and a context-free language with a deep recursive structure.