Enhancing the generalization ability of neural networks through controlling the hidden layers

Authors:
Weishui Wan;Shingo Mabu;Kaoru Shimada;Kotaro Hirasawa;Jinglu Hu
Affiliations:
Graduate School of Information, Production and Systems, Waseda University, Hibikino 2-7, Wakamatsu-ku, Kitakyushu, Fukuoka 808-0135, Japan;Graduate School of Information, Production and Systems, Waseda University, Hibikino 2-7, Wakamatsu-ku, Kitakyushu, Fukuoka 808-0135, Japan;Graduate School of Information, Production and Systems, Waseda University, Hibikino 2-7, Wakamatsu-ku, Kitakyushu, Fukuoka 808-0135, Japan;Graduate School of Information, Production and Systems, Waseda University, Hibikino 2-7, Wakamatsu-ku, Kitakyushu, Fukuoka 808-0135, Japan;Graduate School of Information, Production and Systems, Waseda University, Hibikino 2-7, Wakamatsu-ku, Kitakyushu, Fukuoka 808-0135, Japan
Venue:
Applied Soft Computing
Year:
2009

Citing 19
Cited 4

Comparing biases for minimal network construction with back-propagation

Advances in neural information processing systems 1
The cascade-correlation learning architecture

Advances in neural information processing systems 2
Optimal brain damage

Advances in neural information processing systems 2
A resource-allocating network for function interpolation

Neural Computation
Bayesian interpolation

Neural Computation
A practical Bayesian framework for backpropagation networks

Neural Computation
Simplifying neural networks by soft weight-sharing

Neural Computation
Original Contribution: Improving model selection by nonconvergent methods

Neural Networks
Improving Generalization with Active Learning

Machine Learning - Special issue on structured connectionist systems
Regularization theory and neural networks architectures

Neural Computation
Bayesian regularization and pruning using a Laplace prior

Neural Computation
Structural learning with forgetting

Neural Networks
Convergence suppression and divergence facilitation: minimum and joint use of hidden units by multiple outputs

Neural Networks
Pruning using parameter and neuronal metrics

Neural Computation
An introduction to support Vector Machines: and other kernel-based learning methods

An introduction to support Vector Machines: and other kernel-based learning methods
Neural Networks: A Comprehensive Foundation

Neural Networks: A Comprehensive Foundation
Second Order Derivatives for Network Pruning: Optimal Brain Surgeon

Advances in Neural Information Processing Systems 5, [NIPS Conference]
The lack of a priori distinctions between learning algorithms

Neural Computation
The existence of a priori distinctions between learning algorithms

Neural Computation

Memetic Pareto Evolutionary Artificial Neural Networks to determine growth/no-growth in predictive microbiology

Applied Soft Computing
Review article: A self-organizing map-based initialization for hybrid training of feedforward neural networks

Applied Soft Computing
A Novel Pruning Algorithm for Optimizing Feedforward Neural Network of Classification Problems

Neural Processing Letters
Neural networks letter: Neural architecture design based on extreme learning machine

Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we proposed two new variants of backpropagation algorithm. The common point of these two new algorithms is that the outputs of nodes in the hidden layers are controlled with the aim to solve the moving target problem and the distributed weights problem. One algorithm (AlgoRobust) is not so insensitive to the noises in the data, the second one (AlgoGS) is through using Gauss-Schmidt algorithm to determine in each epoch which weight should be updated, while the other weights are kept unchanged in this epoch. In this way a better generalization can be obtained. Some theoretical explanations are also provided. In addition, simulation comparisons are made between Gaussian regularizer, optimal brain damage (OBD) and the proposed algorithms. Simulation results confirm that the new proposed algorithms perform better than that of Gaussian regularizer, and the first algorithm AlgoRobust performs better than the second algorithm AlgoGS in the noisy data. On the other hand AlgoGS performs better than the AlgoRobust on the data without noise and the final structure obtained by two new algorithms is comparable to that obtained by using OBD.