An efficient gradient-based learning algorithm applied to neural networks with selective actuation neurons

  • Authors:
  • Noel Lopes;Bernardete Ribeiro

  • Affiliations:
  • Institute Polytechnic of Guarda, Department of Informatics, Guarda, Portugal and CISUC - Centro de Informática e Sistemas, Department of Informatics Engineering, University of Coimbra, Portug ...;CISUC - Centro de Informática e Sistemas, Department of Informatics Engineering, University of Coimbra, Portugal

  • Venue:
  • Neural, Parallel & Scientific Computations
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

A new class of Neural Networks (NN), designated the Multiple Feed-Forward (MFF) networks, and a new gradient-based learning algorithm, Multiple Back-Propagation (MBP), are proposed and analyzed. MFF are obtained by integrating two feed-forward networks (a main network and a space network) in a novel manner. A major characteristic is their ability to partition the input space by using selective neurons, whose actuation role is captured through the space localisation of input pattern data. In this sense, only those neurons fired by a particular data point turn out to be relevant, while they retain the capacity to approximate closely to more general, irregular, non-linear features in localized regions. Together, the MFF networks and the MBP algorithm embody a new neural architecture, ensuring, in most cases, a better design choice than the one provided by the Multi-Layer Perceptron (MLP) networks trained with the Back-Propagation (BP) algorithm. The utilization of computable importance factors for the actuation neurons whose relative magnitudes are derived from the space network properties and the training data is the key reason for its ability to decompose the underlying mapping function into simpler sub-functions requiring parsimonious NN. Experimental results on benchmarks confirm improved efficiency of the gradient-based learning algorithm proposed, borne out by better generalization and in most cases by shorter training times for online learning, as compared with the MLP networks trained with the BP algorithm.