Second-Order Learning Algorithm with Squared Penalty Term

Authors:
Kazumi Saito;Ryohei Nakano
Affiliations:
NTT Communication Science Laboratories, Seika-cho, Soraku-gun, Kyoto 619-0237 Japan;NTT Communication Science Laboratories, Seika-cho, Soraku-gun, Kyoto 619-0237 Japan
Venue:
Neural Computation
Year:
2000

Citing 10
Cited 12

Learning translation invariant recognition in massively parallel networks

Volume I: Parallel architectures on PARLE: Parallel Architectures and Languages Europe
Comparing biases for minimal network construction with back-propagation

Advances in neural information processing systems 1
First- and second-order methods for learning: between steepest descent and Newton's method

Neural Computation
Bayesian interpolation

Neural Computation
Original Contribution: A scaled conjugate gradient algorithm for fast supervised learning

Neural Networks
Bayesian regularization and pruning using a Laplace prior

Neural Computation
Structural learning with forgetting

Neural Networks
Partial BFGS update and efficient step-length calculation for three-layer neural networks

Neural Computation
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Pattern Recognition and Neural Networks

Pattern Recognition and Neural Networks

MLP in layer-wise form with applications to weight decay

Neural Computation
Extracting regression rules from neural networks

Neural Networks
Discovering Polynomials to Fit Multivariate Data Having Numeric and Nominal Variables

Progress in Discovery Science, Final Report of the Japanese Discovery Science Project
Finding Polynomials to Fit Multivariate Data Having Numeric and Nominal Variables

IDA '01 Proceedings of the 4th International Conference on Advances in Intelligent Data Analysis
Robust Formulations for Training Multilayer Perceptrons

Neural Computation
Convergence of an online gradient algorithm with penalty for two-layer neural networks

MATH'06 Proceedings of the 10th WSEAS International Conference on APPLIED MATHEMATICS
Boundedness and Convergence of Online Gradient Method with Penalty for Linear Output Feedforward Neural Networks

Neural Processing Letters
Boundedness and convergence of online gradient method with penalty for feedforward neural networks

IEEE Transactions on Neural Networks
Convergence of batch BP algorithm with penalty for FNN training

ICONIP'06 Proceedings of the 13 international conference on Neural Information Processing - Volume Part I
Computational properties and convergence analysis of BPNN for cyclic and almost cyclic learning with penalty

Neural Networks
Allocation of simulation effort for neural network vs. regression metamodels

Proceedings of the Winter Simulation Conference
Convergence of online gradient method for feedforward neural networks with smoothing L 1/2 regularization penalty

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This article compares three penalty terms with respect to the efficiency of supervised learning, by using first- and second-order off-line learning algorithms and a first-order on-line algorithm. Our experiments showed that for a reasonably adequate penalty factor, the combination of the squared penalty term and the second-order learning algorithm drastically improves the convergence performance in comparison to the other combinations, at the same time bringing about excellent generalization performance. Moreover, in order to understand how differently each penalty term works, a function surface evaluation is described. Finally, we show how cross validation can be applied to find an optimal penalty factor.