Selective combination of multiple neural networks for improving model prediction in nonlinear systems modelling through forward selection and backward elimination

Authors:
Zainal Ahmad;Jie Zhang
Affiliations:
School of Chemical Engineering, University Sains Malaysia, Engineering Campus, Seri Ampangan, 14300 Nibong Tebal, Penang, Malaysia;School of Chemical Engineering and Advanced Materials, Newcastle University, Newcastle upon Tyne NE1 7RU, UK
Venue:
Neurocomputing
Year:
2009

Citing 13
Cited 1

Applied regression analysis and other multivariable methods

Applied regression analysis and other multivariable methods
Original Contribution: Stacked generalization

Neural Networks
Hierarchical mixtures of experts and the EM algorithm

Neural Computation
Bagging predictors

Machine Learning
Optimal linear combinations of neural networks

Neural Networks
An information theoretic approach for combining neural network process models

Neural Networks
Inferential estimation of polymer quality using bootstrap aggregated neural networks

Neural Networks
An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization

Machine Learning
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Ensembling neural networks: many could be better than all

Artificial Intelligence
Introduction to the Theory of Neural Computation

Introduction to the Theory of Neural Computation
Bootstrapping Neural Networks

Neural Computation
Adaptive mixtures of local experts

Neural Computation

Multi-objective optimization of a stacked neural network using an evolutionary hyper-heuristic

Applied Soft Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

Combining multiple neural networks appears to be a very promising approach in improving neural network generalisation since it is very difficult, if not impossible, to develop a perfect single neural network. In the building of an aggregated neural network model, a number of individual networks are developed from different data sets and/or different training algorithms. In this paper, individual networks are developed from bootstrap re-samples of the original training and testing data sets. Instead of combining all the developed networks, this paper proposes two selective combination techniques: forward selection and backward elimination. These two techniques essentially combine those individual networks that, when combined, can significantly improve model generalisation. In forward selection, individual networks are gradually added into the aggregated network until the aggregated network error on the original training and testing data sets cannot be further reduced. In backward elimination, all the individual networks are initially aggregated and some of the individual networks are then gradually eliminated until the aggregated network error on the original training and testing data sets cannot be further reduced. The proposed techniques are applied to dynamic nonlinear process modelling and classification of diabetes database. Application results demonstrate that the proposed techniques can significantly improve model generalisation and perform better than aggregating all the individual networks and the heuristic selective combination method where networks with better performance on the training and testing data are selected.