Linear-least-squares initialization of multilayer perceptrons through backpropagation of the desired response

Authors:
D. Erdogmus;O. Fontenla-Romero;J. C. Principe;A. Alonso-Betanzos;E. Castillo
Affiliations:
Dept. of Comput. Sci. & Eng., Oregon Health Sci. Univ., Portland, OR, USA;-;-;-;-
Venue:
IEEE Transactions on Neural Networks
Year:
2005

Citing 0
Cited 10

Optimizing number of hidden neurons in neural networks

AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
Numerical bounds to assure initial local stability of NARX multilayer perceptrons and radial basis functions

Neurocomputing
A Master-Slave Neural Network for precise recognition of the complicated hand operations based on EEG

International Journal of Advanced Media and Communication
A Supervised Learning Method for Neural Networks Based on Sensitivity Analysis with Automatic Regularization

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Error minimized extreme learning machine with growth of hidden nodes and incremental learning

IEEE Transactions on Neural Networks
A fast semi-linear backpropagation learning algorithm

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
A linear learning method for multilayer perceptrons using least-squares

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Neural network based path detection for an FMCW positioning system

EUROCAST'07 Proceedings of the 11th international conference on Computer aided systems theory
An incremental learning method for neural networks based on sensitivity analysis

CAEPIA'09 Proceedings of the Current topics in artificial intelligence, and 13th conference on Spanish association for artificial intelligence
2013 Special Issue: Single-hidden-layer feed-forward quantum neural network based on Grover learning

Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

Training multilayer neural networks is typically carried out using descent techniques such as the gradient-based backpropagation (BP) of error or the quasi-Newton approaches including the Levenberg-Marquardt algorithm. This is basically due to the fact that there are no analytical methods to find the optimal weights, so iterative local or global optimization techniques are necessary. The success of iterative optimization procedures is strictly dependent on the initial conditions, therefore, in this paper, we devise a principled novel method of backpropagating the desired response through the layers of a multilayer perceptron (MLP), which enables us to accurately initialize these neural networks in the minimum mean-square-error sense, using the analytic linear least squares solution. The generated solution can be used as an initial condition to standard iterative optimization algorithms. However, simulations demonstrate that in most cases, the performance achieved through the proposed initialization scheme leaves little room for further improvement in the mean-square-error (MSE) over the training set. In addition, the performance of the network optimized with the proposed approach also generalizes well to testing data. A rigorous derivation of the initialization algorithm is presented and its high performance is verified with a number of benchmark training problems including chaotic time-series prediction, classification, and nonlinear system identification with MLPs.