Convergence of gradient method for a fully recurrent neural network

Authors:
Dongpo Xu;Zhengxue Li;Wei Wu
Affiliations:
Dalian University of Technology, Department of Applied Mathematics, 116024, Dalian, People’s Republic of China;Dalian University of Technology, Department of Applied Mathematics, 116024, Dalian, People’s Republic of China;Dalian University of Technology, Department of Applied Mathematics, 116024, Dalian, People’s Republic of China
Venue:
Soft Computing - A Fusion of Foundations, Methodologies and Applications
Year:
2009

Citing 0
Cited 2

Convergence analysis of three classes of split-complex gradient algorithms for complex-valued recurrent neural networks

Neural Computation
A context layered locally recurrent neural network for dynamic system identification

Engineering Applications of Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recurrent neural networks have been successfully used for analysis and prediction of temporal sequences. This paper is concerned with the convergence of a gradient-descent learning algorithm for training a fully recurrent neural network. In literature, stochastic process theory has been used to establish some convergence results of probability nature for the on-line gradient training algorithm, based on the assumption that a very large number of (or infinitely many in theory) training samples of the temporal sequences are available. In this paper, we consider the case that only a limited number of training samples of the temporal sequences are available such that the stochastic treatment of the problem is no longer appropriate. Instead, we use an off-line gradient training algorithm for the fully recurrent neural network, and we accordingly prove some convergence results of deterministic nature. The monotonicity of the error function in the iteration is also guaranteed. A numerical example is given to support the theoretical findings.