Efficient calculation of the gauss-newton approximation of the hessian matrix in neural networks

Authors:
Michael Fairbank;Eduardo Alonso
Affiliations:
-;-
Venue:
Neural Computation
Year:
2012

Citing 3
Cited 0

Implementation of Strassen's algorithm for matrix multiplication

Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Fast curvature matrix-vector products for second-order gradient descent

Neural Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Levenberg-Marquardt (LM) learning algorithm is a popular algorithm for training neural networks; however, for large neural networks, it becomes prohibitively expensive in terms of running time and memory requirements. The most time-critical step of the algorithm is the calculation of the Gauss-Newton matrix, which is formed by multiplying two large Jacobian matrices together. We propose a method that uses backpropagation to reduce the time of this matrix-matrix multiplication. This reduces the overall asymptotic running time of the LM algorithm by a factor of the order of the number of output nodes in the neural network.