Empirical analysis of support vector machine ensemble classifiers

  • Authors:
  • Shi-jin Wang;Avin Mathew;Yan Chen;Li-feng Xi;Lin Ma;Jay Lee

  • Affiliations:
  • Department of Industrial Engineering and Management, School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China;Cooperative Research Centre for Integrated Engineering Asset Management (CIEAM), Queensland University of Technology, Brisbane, Australia;NSF Center for Intelligent Maintenance Systems (IMS), University of Cincinnati, Cincinnati, USA;Department of Industrial Engineering and Management, School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China;Cooperative Research Centre for Integrated Engineering Asset Management (CIEAM), Queensland University of Technology, Brisbane, Australia;NSF Center for Intelligent Maintenance Systems (IMS), University of Cincinnati, Cincinnati, USA

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2009

Quantified Score

Hi-index 12.06

Visualization

Abstract

Ensemble classification - combining the results of a set of base learners - has received much attention in the machine learning community and has demonstrated promising capabilities in improving classification accuracy. Compared with neural network or decision tree ensembles, there is no comprehensive empirical research in support vector machine (SVM) ensembles. To fill this void, this paper analyses and compares SVM ensembles with four different ensemble constructing techniques, namely bagging, AdaBoost, Arc-X4 and a modified AdaBoost. Twenty real-world data sets from the UCI repository are used as benchmarks to evaluate and compare the performance of these SVM ensemble classifiers by their classification accuracy. Different kernel functions and different numbers of base SVM learners are tested in the ensembles. The experimental results show that although SVM ensembles are not always better than a single SVM, the SVM bagged ensemble performs as well or better than other methods with a relatively higher generality, particularly SVMs with a polynomial kernel function. Finally, an industrial case study of gear defect detection is conducted to validate the empirical analysis results.