Simple Recurrent Networks Learn Context-Free and Context-Sensitive Languages by Counting

Authors:
Paul Rodriguez
Affiliations:
Department of Cognitive Science, University of California at San Diego, La Jolla, CA 92093, U.S.A.
Venue:
Neural Computation
Year:
2001

Citing 18
Cited 15

Crafting a compiler

Crafting a compiler
Fractals everywhere

Fractals everywhere
Recursive distributed representations

Artificial Intelligence - On connectionist symbol processing
Distributed Representations, Simple Recurrent Networks, And Grammatical Structure

Machine Learning - Connectionist approaches to language learning
The Induction of Dynamical Recognizers

Machine Learning - Connectionist approaches to language learning
Mechanisms of implicit learning: connectionist models of sequence processing

Mechanisms of implicit learning: connectionist models of sequence processing
Foundations of recurrent neural networks

Foundations of recurrent neural networks
On the computational power of neural nets

Journal of Computer and System Sciences
Exploring the computational capabilities of recurrent neural networks

Exploring the computational capabilities of recurrent neural networks
The dynamic universality of sigmoidal neural networks

Information and Computation
Analysis of dynamical recognizers

Neural Computation
Dynamical recognizers: real time language recognition by analog computers

Theoretical Computer Science
Recurrent neural networks can learn to implement symbol-sensitive counting

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Inference of Reversible Languages

Journal of the ACM (JACM)
Theory of Syntactic Recognition for Natural Languages

Theory of Syntactic Recognition for Natural Languages
Handbook of Formal Languages

Handbook of Formal Languages
Designing a Counter: Another Case Study of Dynamics and Activation Landscapes in Recurrent Networks

KI '97 Proceedings of the 21st Annual German Conference on Artificial Intelligence: Advances in Artificial Intelligence
The dynamics of discrete-time computation, with application to recurrent neural networks and finite state machine extraction

Neural Computation

Learning nonregular languages: a comparison of simple recurrent networks and LSTM

Neural Computation
Incremental training of first order recurrent neural networks to predict a context-sensitive language

Neural Networks
Evolution of Neural Architecture Fitting Environmental Dynamics

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
2007 Special Issue: An experimental unification of reservoir computing methods

Neural Networks
Elman Backpropagation as Reinforcement for Simple Recurrent Networks

Neural Computation
On the implicit acquisition of a context-free grammar by a simple recurrent neural network

Neurocomputing
A Characterization of Simple Recurrent Neural Networks with Two Hidden Units as a Language Recognizer

Neural Information Processing
Analysis and Visualization of the Dynamics of Recurrent Neural Networks for Symbolic Sequences Processing

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part II
Extracting symbolic knowledge from recurrent neural networks---A fuzzy logic approach

Fuzzy Sets and Systems
Comparison of echo state networks with simple recurrent networks and variable-length Markov models on symbolic sequences

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Training recurrent connectionist models on symbolic time series

ICONIP'08 Proceedings of the 15th international conference on Advances in neuro-information processing - Volume Part I
Emergence of hierarchical structure mirroring linguistic composition in a recurrent neural network

Neural Networks
Supervised Learning of Logical Operations in Layered Spiking Neural Networks with Spike Train Encoding

Neural Processing Letters
Fractal unfolding: a metamorphic approach to learning to parse recursive structure

CMCL '12 Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics
Solving graph data issues using a layered architecture approach with applications to web spam detection

Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

It has been shown that if a recurrent neural network (RNN) learns to process a regular language, one can extract a finite-state machine (FSM) by treating regions of phase-space as FSM states. However, it has also been shown that one can construct an RNN to implement Turing machines by using RNN dynamics as counters. But how does a network learn languages that require counting? Rodriguez, Wiles, and Elman (1999) showed that a simple recurrent network (SRN) can learn to process a simple context-free language (CFL) by counting up and down. This article extends that to show a range of language tasks in which an SRN develops solutions that not only count but also copy and store counting information. In one case, the network stores information like an explicit storage mechanism. In other cases, the network stores information more indirectly in trajectories that are sensitive to slight displacements that depend on context. In this sense, an SRN can learn analog computation as a set of interdependent counters. This demonstrates how SRNs may be an alternative psychological model of language or sequence processing.