Convergence and objective functions of some fault/noise-injection-based online learning algorithms for RBF networks

  • Authors:
  • Kevin I.-J. Ho;Chi-Sing Leung;John Sum

  • Affiliations:
  • Department of Computer Science and Communication Engineering, Providence University, Sha-Lu, Taiwan;Department of Electronic Engineering, The City University of Hong Kong, Kowloon, Hong Kong;Institute of Electronic Commerce, National Chung Hsing University, Taichung, Taiwan and Department of Electronic Engineering, The City University of Hong Kong, Kowloon, Hong Kong

  • Venue:
  • IEEE Transactions on Neural Networks
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

In the last two decades, many online fault/noise injection algorithms have been developed to attain a fault tolerant neural network. However, not much theoretical works related to their convergence and objective functions have been reported. This paper studies six common fault/noise-injection-based online learning algorithms for radial basis function (RBF) networks, namely 1) injecting additive input noise, 2) injecting additive/multiplicative weight noise, 3) injecting multiplicative node noise, 4) injecting multiweight fault (random disconnection of weights), 5) injecting multinode fault during training, and 6) weight decay with injecting multinode fault. Based on the Gladyshev theorem, we show that the convergence of these six online algorithms is almost sure. Moreover, their true objective functions being minimized are derived. For injecting additive input noise during training, the objective function is identical to that of the Tikhonov regularizer approach. For injecting additive/multiplicative weight noise during training, the objective function is the simple mean square training error. Thus, injecting additive/multiplicative weight noise during training cannot improve the fault tolerance of an RBF network. Similar to injective additive input noise, the objective functions of other fault/noise-injection-based online algorithms contain a mean square error term and a specialized regularization term.