Hybridizing mixtures of experts with support vector machines: Investigation into nonlinear dynamic systems identification

  • Authors:
  • Clodoaldo A. M. Lima;André L. V. Coelho;Fernando J. Von Zuben

  • Affiliations:
  • Laboratory of Bioinformatics and Bio-inspired Computing (LBiC), Faculty of Electrical and Computer Engineering (FEEC), State University of Campinas (Unicamp), P.O. Box 6101, Zip-code 13083-970 Cam ...;Graduate Program in Applied Informatics (MIA), Center of Technological Sciences (CCT), University of Fortaleza (Unifor), Washington Soares Av. 1321, Bl. J, Zip-code 60811-905 Fortaleza, CE, Brazil;Laboratory of Bioinformatics and Bio-inspired Computing (LBiC), Faculty of Electrical and Computer Engineering (FEEC), State University of Campinas (Unicamp), P.O. Box 6101, Zip-code 13083-970 Cam ...

  • Venue:
  • Information Sciences: an International Journal
  • Year:
  • 2007

Quantified Score

Hi-index 0.07

Visualization

Abstract

Mixture of experts (ME) models comprise a family of modular neural network architectures aiming at distilling complex problems into simple subtasks. This is done by deploying a separate gating module for softly dividing the input space into overlapping regions to be each assigned to one or more expert networks. Conversely, support vector machines (SVMs) refer to kernel-based methods, neural-network-alike models that constitute an approximate implementation of the structural risk minimization principle. Such learning machines follow the simple, but powerful idea of nonlinearly mapping input data into high-dimensional feature spaces wherein a linear decision surface discriminating different regions is properly designed. In this work, we formally characterize and empirically evaluate a novel approach, named as Mixture of Support Vector Machine Experts (MSVME), whose main purpose is to combine the complementary properties of both SVM and ME models. In the formal characterization, an algorithm based on a maximum likelihood criterion is considered for the MSVME training, and we demonstrate that it is possible to train each expert based on an SVM perspective. Regarding the empirical evaluation, simulation results involving nonlinear dynamic system identification problems are reported, contrasting the performance shown by the MSVME approach with that exhibited by conventional SVM and ME models.