Spurious valleys in the error surface of recurrent networks: analysis and avoidance

  • Authors:
  • Jason Horn;Orlando De Jesús;Martin T. Hagan

  • Affiliations:
  • Agilent Technologies High Frequency Technology Center, Santa Clara, CA;Research Department, Halliburton Energy Services, Dallas, TX;School of Electrical and Computer Engineering, Oklahoma State University, Stillwater, OK

  • Venue:
  • IEEE Transactions on Neural Networks
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper gives a detailed analysis of the error surfaces of certain recurrent networks and explains some difficulties encountered in training recurrent networks. We show that these error surfaces contain many spurious valleys, and we analyze the mechanisms that cause the valleys to appear. We demonstrate that the principle mechanism can be understood through the analysis of the roots of random polynomials. This paper also provides suggestions for improvements in batch training procedures that can help avoid the difficulties caused by spurious valleys, thereby improving training speed and reliability.