A scalable parallel algorithm for training a hierarchical mixture of neural experts

Authors:
Pablo A. Estévez;Hélène Paugam-Moisy;Didier Puzenat;Manuel Ugarte
Affiliations:
Departamento de Ingeniería Eléctrica, Universidad de Chile, Casilla 412-3, Santiago, Chile;Institut des Sciences Cognitives, UMR CNRS 5015, 67 boulevard Pinel, F-69675 Bron Cedex, France;Institut des Sciences Cognitives, UMR CNRS 5015, 67 boulevard Pinel, F-69675 Bron Cedex, France and Equipe GRIMAAG, Université Antilles-Guyane, Campus de Fouillole, F-97159 Pointe-à-Pitr ...;Departamento de Ingeniería Eléctrica, Universidad de Chile, Casilla 412-3, Santiago, Chile
Venue:
Parallel Computing
Year:
2002

Citing 17
Cited 4

Mapping neural networks onto message-passing multicomputers

Journal of Parallel and Distributed Computing - Neural Computing
Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Introduction to parallel computing: design and analysis of algorithms

Introduction to parallel computing: design and analysis of algorithms
Parallel neural computation based on algebraic partitioning

Parallel algorithms
Parallel neural computing based on network duplicating

Parallel algorithms
Hierarchical mixtures of experts and the EM algorithm

Neural Computation
A Bayesian approach to model selection in hierarchical mixtures-of-experts architectures

Neural Networks
Mapping Conjugate Gradient Algorithms for Neutron Diffusion Applications onto SIMD, MIMD, and Mixed-Mode Machines

International Journal of Parallel Programming
A computational study of the focus-of-attention EM-ML algorithm for PET reconstruction

Parallel Computing - Special double issue on biomedical applications
Multiprocessor simulation of neural networks

The handbook of brain theory and neural networks
Exploiting Domain-Specific Properties: Compiling Parallel Dynamic Neural Network Algorithms into Efficient Code

IEEE Transactions on Parallel and Distributed Systems
Improved learning algorithms for mixture of experts in multiclass classification

Neural Networks
Graph partitioning models for parallel computing

Parallel Computing - Special issue on graph partioning and parallel computing
Parallel Implementations of Backpropagation Neural Networks on Transputers: A Study of Training Set Parallelism

Parallel Implementations of Backpropagation Neural Networks on Transputers: A Study of Training Set Parallelism
A Scalable Parallel Formulation of the Backpropagation Algorithm for Hypercubes and Related Architectures

IEEE Transactions on Parallel and Distributed Systems
Optimal Speedup Conditions for a Parallel Back-Propagation Algorithm

CONPAR '92/ VAPP V Proceedings of the Second Joint International Conference on Vector and Parallel Processing: Parallel Processing
Efficient mapping of backpropagation algorithm onto a network ofworkstations

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

A distributed and multithreaded neural event driven simulation framework

PDCN'06 Proceedings of the 24th IASTED international conference on Parallel and distributed computing and networks
Parallel Approach for Ensemble Learning with Locally Coupled Neural Networks

Neural Processing Letters
A parallel evolving algorithm for flexible neural tree

Parallel Computing
Computational grid vs. parallel computer for coarse-grain parallelization of neural networks training

OTM'05 Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Efficient parallel learning algorithms are proposed for training a powerful modular neural network, the hierarchical mixture of experts (HME). Parallelizations are based on the concept of modular parallelism, i.e. parallel execution of network modules. From modeling the speed-up as a function of the number of processors and the number of training examples, several improvements are derived, such as pipelining the training examples by packets. Compared to experimental measurements, theoretical models are accurate. For regular topologies, an analysis of the models shows that the parallel algorithms are highly scalable when the size of the experts grows from linear units to multi-layer perceptrons (MLPs). These results are confirmed experimentally, achieving near-linear speedups for HME-MLP. Although this work can be viewed as a case study in the parallelization of HME neural networks, both algorithms and theoretical models can be expanded to different learning rules or less regular tree architectures.