Nonlinear clustering-based support vector machine for large data sets

  • Authors:
  • Yongqiao Wang;Xun Zhang;Souyang Wang;K. K. Lai

  • Affiliations:
  • College of Finance, Zhejiang Gongshang University, Hangzhou, Zhejiang, People's Republic of China;Institute of Systems Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, People's Republic of China;Institute of Systems Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, People's Republic of China;Department of Management Sciences, City University of Hong Kong, Kowloon, Hong Kong SAR, People's Republic of China

  • Venue:
  • Optimization Methods & Software - Mathematical programming in data mining and machine learning
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a kernel clustering-based support vector machine (KCB-SVM) that generalizes the linear clustering-based support vector machine (CB-SVM) to solve nonlinear classification problems in a novel way. It can not only handles large data sets, but can also have nonlinear discriminant power. By introducing kernel clustering, KCB-SVM unifies the metrics in the clustering and training stages. Elaborately designed clustering features summarize all the information required for further clustering and training, which allows only one scan of the total data sets. Experiments on both artificial and real large data sets show that the KCB-SVM not only achieves better classification accuracy than random sampling, active learning and CB-SVM, but also retains the ability to handle large data sets.