Subspace Distance-Based Sampling Method for SVM

Authors:
Xiaofei Zhou;Yong Shi
Affiliations:
-;-
Venue:
ICDMW '10 Proceedings of the 2010 IEEE International Conference on Data Mining Workshops
Year:
2010

Citing 0
Cited 1

Theme word subspace method for text document categorization

DM-IKM '12 Proceedings of the Data Mining and Intelligent Knowledge Management Workshop

Quantified Score

Hi-index	0.02

Visualization

Abstract

Support Vector Machine (SVM) is an effective classifier for classification task, but a vital shortcoming of SVM is that it needs huge computation for large scale learning tasks. Sample selection is a feasible strategy to overcome the problem. In order to rapidly reduce training samples without sacrificing recognition accuracy, this paper presents a novel sample selection strategy based on subspace distance, called subspace sample selection. Subspace selection method tries to select boundary samples of each class convex hull by iteratively absorbing the furthest sample to the subspace of chosen samples. This selection method can efficiently represent original training set and support SVM classification. Experimental results also show that our sample selection method can select fewer high quality samples to maintain the recognition accuracy of SVM