A new constructive algorithm for architectural and functional adaptation of artificial neural networks

Authors:
Md. Monirul Islam;Md. Abdus Sattar;Md. Faijul Amin;Xin Yao;Kazuyuki Murase
Affiliations:
Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh and Department of Human and Artificial Intelligence Systems, Graduate School ...;Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh;Department of Human and Artificial Intelligence Systems, University of Fukui, Fukui, Japan;Center of Excellence for Research in Computational Intelligence and Applications, School of Computer Science, University of Birmingham, Birmingham, UK;Department of Human and Artificial Intelligence Systems, Graduate School of Engineering, University of Fukui, Fukui, Japan and Research and Education Program for Life Science, University of Fukui, ...
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
2009

Citing 35
Cited 9

Future paths for integer programming and links to artificial intelligence

Computers and Operations Research - Special issue: Applications of integer programming
Multilayer feedforward networks are universal approximators

Neural Networks
The Strength of Weak Learnability

Machine Learning
Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
The cascade-correlation learning architecture

Advances in neural information processing systems 2
Optimal brain damage

Advances in neural information processing systems 2
Approximation capabilities of multilayer feedforward networks

Neural Networks
A quantitative study of experimental evaluations of neural network learning algorithms: current research practice

Neural Networks
Automatic early stopping using cross validation: quantifying the criteria

Neural Networks
A pruning method for the recursive least squared algorithm

Neural Networks
A new algorithm to design compact two-hidden-layer artificial neural networks

Neural Networks
Second Order Derivatives for Network Pruning: Optimal Brain Surgeon

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Artificial Intelligence: A Modern Approach

Artificial Intelligence: A Modern Approach
An introduction to ROC analysis

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
A fast learning algorithm for deep belief nets

Neural Computation
Neural Network Theory

Neural Network Theory
An empirical evaluation of deep architectures on problems with many factors of variation

Proceedings of the 24th international conference on Machine learning
A new adaptive merging and growing algorithm for designing artificial neural networks

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Regularization parameter estimation for feedforward neural networks

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Evolutionary neural networks for anomaly detection based on the behavior of a program

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
An Ensemble-Based Incremental Learning Approach to Data Fusion

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Bagging and Boosting Negatively Correlated Neural Networks

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
The sample complexity of pattern classification with neural networks: the size of the weights is more important than the size of the network

IEEE Transactions on Information Theory
Capabilities of a four-layered feedforward neural network: four layers versus three

IEEE Transactions on Neural Networks
Constructive algorithms for structure learning in feedforward neural networks for regression problems

IEEE Transactions on Neural Networks
A new evolutionary system for evolving artificial neural networks

IEEE Transactions on Neural Networks
Objective functions for training new hidden units in constructive neural networks

IEEE Transactions on Neural Networks
Constructive neural-network learning algorithms for pattern classification

IEEE Transactions on Neural Networks
Modified cascade-correlation learning for classification

IEEE Transactions on Neural Networks
A new pruning heuristic based on variance analysis of sensitivity information

IEEE Transactions on Neural Networks
Neural-network construction and selection in nonlinear modeling

IEEE Transactions on Neural Networks
Constructive feedforward neural networks using Hermite polynomial activation functions

IEEE Transactions on Neural Networks
A node pruning algorithm based on a Fourier amplitude sensitivity test method

IEEE Transactions on Neural Networks
An Optimization Methodology for Neural Network Weights and Architectures

IEEE Transactions on Neural Networks
Use of a quasi-Newton method in a feedforward neural network construction algorithm

IEEE Transactions on Neural Networks

An efficient collaborative recommender system based on k-separability

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part III
A parallel evolving algorithm for flexible neural tree

Parallel Computing
A hybrid neural network model based reinforcement learning agent

ISNN'10 Proceedings of the 7th international conference on Advances in Neural Networks - Volume Part I
A self learning rough fuzzy neural network classifier for mining temporal patterns

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Privacy-preserving back-propagation and extreme learning machine algorithms

Data & Knowledge Engineering
Swarm optimization and Flexible Neural Tree for microarray data classification

Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology
A new constructive neural network method for noise processing and its application on stock market prediction

Applied Soft Computing
An accuracy-oriented self-splitting fuzzy classifier with support vector learning in high-order expanded consequent space

Applied Soft Computing
Comparing large number of metaheuristics for artificial neural networks training to predict water temperature in a natural river

Computers & Geosciences

Quantified Score

Hi-index	0.00

Visualization

Abstract

The generalization ability of artificial neural networks (ANNs) is greatly dependent on their architectures. Constructive algorithms provide an attractive automatic way of determining a near-optimal ANN architecture for a given problem. Several such algorithms have been proposed in the literature and shown their effectiveness. This paper presents a new constructive algorithm (NCA) in automatically determining ANN architectures. Unlike most previous studies on determining ANN architectures, NCA puts emphasis on architectural adaptation and functional adaptation in its architecture determination process. It uses a constructive approach to determine the number of hidden layers in an ANN and of neurons in each hidden layer. To achieve functional adaptation, NCA trains hidden neurons in the ANN by using different training sets that were created by employing a similar concept used in the boosting algorithm. The purpose of using different training sets is to encourage hidden neurons to learn different parts or aspects of the training data so that the ANN can learn the whole training data in a better way. In this paper, the convergence and computational issues of NCA are analytically studied. The computational complexity of NCA is found to be O(W × Pt × τ), where W is the number of weights in the ANN, Pt is the number of training examples, and τ is the number of training epochs. This complexity has the same order as what the backpropagation learning algorithm requires for training a fixed ANN architecture. A set of eight classification and two approximation benchmark problems was used to evaluate the performance of NCA. The experimental results show that NCA can produce ANN architectures with fewer hidden neurons and better generalization ability compared to existing constructive and nonconstructive algorithms.