Neighbors' distribution property and sample reduction for support vector machines

  • Authors:
  • Fa Zhu;Jian Yang;Ning Ye;Cong Gao;Guobao Li;Tongmin Yin

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • Applied Soft Computing
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

For data pre-processing of SVMs, many scholars tried to find those samples, which would become support vectors. Generally, support vectors locate in the overlap regions, which are between different classes. But overlap region does not always exist. In this paper, a new method is proposed to find the boundary regions of each class instead of overlap regions. This method could deal with the dataset without overlap regions. Summing the cosine of the sample-neighbor angle, the sum ranges from 0 to k. When the sample locates in the boundary region of data distribution, the sum would be close to k; when the sample locates in the interior of the data distribution, the sum would be close to 0. Using cosine sum, the samples locating in the interior of each class can be disposed before SVMs training. Experimental results show that the proposed method can solve the problem, which the methods based on finding overlap regions cannot deal with.