Large Scale Classification with Support Vector Machine Algorithms

Authors:
Thanh-Nghi Do;Jean-Daniel Fekete
Affiliations:
-;-
Venue:
ICMLA '07 Proceedings of the Sixth International Conference on Machine Learning and Applications
Year:
2007

Citing 0
Cited 4

Speed Up SVM Algorithm for Massive Classification Tasks

ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Using text mining and sentiment analysis for online forums hotspot detection and forecast

Decision Support Systems
High dimensional image categorization

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Inverse matrix-free incremental proximal support vector machine

Decision Support Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Boosting of Least-Squares Support Vector Machine (LS-SVM) algorithms can classify large datasets on standard personal computers (PCs). We extend the LSSVM proposed by Suykens and Vandewalle in several ways to efficiently classify large datasets. We developed a row-incremental version for datasets with billions of data points and up to 10,000 dimensions. By adding a Tikhonov regularization term and using the Sherman-Morrison-Woodbury formula, we developed a column-incremental LS-SVM to process datasets with a small number of data points but very high dimensionality. Finally, by applying boosting to these incremental LS-SVM algorithms, we developed classification algorithms for massive, very-highdimensional datasets, and we also applied these ideas to build boosting of other efficient SVM algorithms proposed by Mangasarian, including Lagrange SVM (LSVM), Proximal SVM (PSVM) and Newton SVM (NSVM). Numerical test results on UCI, RCV1binary,Reuters-21578, Forest cover type and KDD cup 1999 datasets showed that our algorithms are often significantly faster and/or more accurate than state-of- the-art algorithms LibSVM, SVM-perf and CB-SVM.