Mixture of experts regression modeling by deterministic annealing

Authors:
A.V. Rao;D. Miller;K. Rose;A. Gersho
Affiliations:
Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA;-;-;-
Venue:
IEEE Transactions on Signal Processing
Year:
1997

Citing 0
Cited 4

A Deterministic Annealing Approach for Parsimonious Design of Piecewise Regression Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
A comparison of global, recurrent and smoothed-piecewise neural models for Istanbul stock exchange (ISE) prediction

Pattern Recognition Letters
Logistic ensembles of Random Spherical Linear Oracles for microarray classification

International Journal of Data Mining and Bioinformatics
A probability collectives approach with a feasibility-based rule for constrained optimization

Applied Computational Intelligence and Soft Computing

Quantified Score

Hi-index	35.68

Visualization

Abstract

We propose a new learning algorithm for regression modeling. The method is especially suitable for optimizing neural network structures that are amenable to a statistical description as mixture models. These include mixture of experts, hierarchical mixture of experts (HME), and normalized radial basis functions (NRBF). Unlike recent maximum likelihood (ML) approaches, we directly minimize the (squared) regression error. We use the probabilistic framework as means to define an optimization method that avoids many shallow local minima on the complex cost surface. Our method is based on deterministic annealing (DA), where the entropy of the system is gradually reduced, with the expected regression cost (energy) minimized at each entropy level. The corresponding Lagrangian is the system's “free-energy”, and this annealing process is controlled by variation of the Lagrange multiplier, which acts as a “temperature” parameter. The new method consistently and substantially outperformed the competing methods for training NRBF and HME regression functions over a variety of benchmark regression examples