Neighborhood Preprocessing SVM for Large-Scale Data Sets Classification

Authors:
Guangxi Chen;Jian Xu;Xiaolin Xiang
Affiliations:
-;-;-
Venue:
FSKD '08 Proceedings of the 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 02
Year:
2008

Citing 0
Cited 1

An algorithm for training a large scale support vector machine for regression based on linear programming and decomposition methods

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

Support vector machine (SVM) has been a promising method for data mining and machine learning in recent years. However, the training complexity of SVM is highly dependent on the size of a data set. A preprocessing Support Vector Machines (P-SVM) method for large-scale data set classification is presented to speed up SVM training. By analyzing the neighbor classification feature for each sample in training data set, a decision criterion was built to keep or delete this sample from the original data set without losing the classification. The new method can provide an SVM with high quality samples. Experiments with random data and UCI databases show that SVM with our new preprocessing method retains the high quality of training data set and the classification accuracy very well.