On the implicit acquisition of a context-free grammar by a simple recurrent neural network

Authors:
Bo Cartling
Affiliations:
Department of Theoretical Physics, Royal Institute of Technology, AlbaNova University Center, SE-106 91 Stockholm, Sweden
Venue:
Neurocomputing
Year:
2008

Citing 16
Cited 2

Introduction to the theory of neural computation

Introduction to the theory of neural computation
Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Distributed Representations, Simple Recurrent Networks, And Grammatical Structure

Machine Learning - Connectionist approaches to language learning
The Induction of Dynamical Recognizers

Machine Learning - Connectionist approaches to language learning
Learning and extracting finite state automata with second-order recurrent neural networks

Neural Computation
Numerical recipes in C (2nd ed.): the art of scientific computing

Numerical recipes in C (2nd ed.): the art of scientific computing
Analog computation via neural networks

Theoretical Computer Science
On the computational power of neural nets

Journal of Computer and System Sciences
Long short-term memory

Neural Computation
Natural Language Grammatical Inference with Recurrent Neural Networks

IEEE Transactions on Knowledge and Data Engineering
Rule Extraction from Recurrent Neural Networks: A Taxonomy and Review

Neural Computation
Simple Recurrent Networks Learn Context-Free and Context-Sensitive Languages by Counting

Neural Computation
Introduction to Automata Theory, Languages, and Computation (3rd Edition)

Introduction to Automata Theory, Languages, and Computation (3rd Edition)
Finite state automata and simple recurrent networks

Neural Computation
The dynamics of discrete-time computation, with application to recurrent neural networks and finite state machine extraction

Neural Computation
LSTM recurrent networks learn simple context-free and context-sensitive languages

IEEE Transactions on Neural Networks

A cognitive interactionist sentence parser with simple recurrent networks

Information Sciences: an International Journal
Questionnaires-based skin attribute prediction using Elman neural network

Neurocomputing

Quantified Score

Hi-index	0.01

Visualization

Abstract

The performance of a simple recurrent neural network on the implicit acquisition of a context-free grammar is re-examined and found to be significantly higher than previously reported by Elman. This result is obtained although the previous work employed a multilayer extension of the basic form of simple recurrent network and restricted the complexity of training and test corpora. The high performance is traced to a well-organized internal representation of the grammatical elements, as probed by a principal-component analysis of the hidden-layer activities. From the next-symbol-prediction performance on sentences not present in the training corpus, a capacity of generalization is demonstrated.