Improving the Learning Speed in 2-Layered LSTM Network by Estimating the Configuration of Hidden Units and Optimizing Weights Initialization

  • Authors:
  • Débora C. Corrêa;Alexandre L. Levada;José H. Saito

  • Affiliations:
  • Computer Department, Federal University of São Carlos, São Paulo, Brazil;Physics Institute of São Carlos, University of São Paulo, São Paulo, Brazil;Computer Department, Federal University of São Carlos, São Paulo, Brazil

  • Venue:
  • ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a method to initialize the LSTM network weights and estimate the configuration of hidden units in order to improve training time for function approximation tasks. The motivation of this method is based on the behavior of the hidden units and the complexity of the function to be approximated. The results obtained for 1-D and 2-D functions show that the proposed methodology improves the network performance, stabilizing the training phase.