Learning and generalization in cascade network architectures

Authors:
Enno Littmann;Helge Ritter
Affiliations:
Abt. Neuroinformatik, Fakultät für Informatik, Universität Ulm, D-89069 Ulm, Germany;AG Neuroinformatik, Technische Fakultät, Universität Bielefeld, D-33615 Bielefeld, Germany
Venue:
Neural Computation
Year:
1996

Citing 13
Cited 4

Self-organization and associative memory: 3rd edition

Self-organization and associative memory: 3rd edition
What size net gives valid generalization?

Neural Computation
The upstart algorithm: a method for constructing and training feedforward neural networks

Neural Computation
The cascade-correlation learning architecture

Advances in neural information processing systems 2
Optimal brain damage

Advances in neural information processing systems 2
Bumptrees for efficient function, constraint, and classification learning

NIPS-3 Proceedings of the 1990 conference on Advances in neural information processing systems 3
Evaluation of adaptive mixtures of competing experts

NIPS-3 Proceedings of the 1990 conference on Advances in neural information processing systems 3
Neural computation and self-organizing maps: an introduction

Neural computation and self-organizing maps: an introduction
Toward generating neural network structures for function approximation

Neural Networks
An Algorithm for Finding Best Matches in Logarithmic Expected Time

ACM Transactions on Mathematical Software (TOMS)
A two-dimensional interpolation function for irregularly-spaced data

ACM '68 Proceedings of the 1968 23rd ACM national conference
Predicting the future: Advantages of semilocal units

Neural Computation
Adaptive mixtures of local experts

Neural Computation

Active learning in neural networks

New learning paradigms in soft computing
An Ensemble of Cooperative Extended Kohonen Maps for Complex Robot Motion Tasks

Neural Computation
Neural network for graphs: a contextual constructive approach

IEEE Transactions on Neural Networks
Rules extraction from constructively trained neural networks based on genetic algorithms

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Incrementally constructed cascade architectures are a promising alternative to networks of predefined size. This paper compares the direct cascade architecture (DCA) proposed in Littmann and Ritter (1992) to the cascade-correlation approach of Fahlman and Lebiere (1990) and to related approaches and discusses the properties on the basis of various benchmark results. One important virtue of DCA is that it allows the cascading of entire subnetworks, even if these admit no error-backpropagation. Exploiting this flexibility and using LLM networks as cascaded elements, we show that the performance of the resulting network cascades can be greatly enhanced compared to the performance of a single network. Our results for the Mackey-Glass time series prediction task indicate that such deeply cascaded network architectures achieve good generalization even on small data sets, when shallow, broad architectures of comparable size suffer from overfitting. We conclude that the DCA approach offers a powerful and flexible alternative to existing schemes such as, e.g., the mixtures of experts approach, for the construction of modular systems from a wide range of subnetwork types.