Learning efficiency of redundant neural networks in Bayesian estimation

Authors:
S. Watanabe
Affiliations:
Precision & Intelligence Lab., Tokyo Inst. of Technol., Yokohama
Venue:
IEEE Transactions on Neural Networks
Year:
2001

Citing 0
Cited 4

Algebraic geometrical methods for hierarchical learning machines

Neural Networks
Learning coefficients of layered models when the true distribution mismatches the singularities

Neural Computation
Equations of states in singular statistical estimation

Neural Networks
Learning coefficient of generalization error in bayesian estimation and vandermonde matrix-type singularity

Neural Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proves that the Bayesian stochastic complexity of a layered neural network is asymptotically smaller than that of a regular statistical model if it contains the true distribution. We consider a case when a three-layer perceptron with M input units, H hidden units and N output units is trained to estimate the true distribution represented by the model with H0 hidden units and prove that the stochastic complexity is asymptotically smaller than (1/2) {H0 (M+N)+R} log n where n is the number of training samples and R is a function of H-H0, M, and N that is far smaller than the number of redundant parameters. Since the generalization error of Bayesian estimation is equal to the increase of stochastic complexity, it is smaller than (1/2 n) {H0 (M+N)+R} if it has an asymptotic expansion. Based on the results, the difference between layered neural networks and regular statistical models is discussed from the statistical point of view