On the use of surrounding neighbors for synthetic over-sampling of the minority class

  • Authors:
  • V. García;J. S. Sánchez;R. A. Mollineda

  • Affiliations:
  • Dept. Llenguatges i Sistemes Informàtics, Universitat Jaume I, Castelló de la Plana, Spain;Dept. Llenguatges i Sistemes Informàtics, Universitat Jaume I, Castelló de la Plana, Spain;Dept. Llenguatges i Sistemes Informàtics, Universitat Jaume I, Castelló de la Plana, Spain

  • Venue:
  • SMO'08 Proceedings of the 8th conference on Simulation, modelling and optimization
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

It has been observed that class imbalance may produce an important deterioration of the classification accuracy. One of the most popular methods to tackle this problem is the synthetic minority over-sampling technique (SMOTE). From the original SMOTE algorithm, we here propose the use of three surrounding neighborhood approaches with the aim of generating artificial minority examples, but taking both the proximity and the spatial distribution of the examples into account. Experiments with ten real data sets are conducted to compare the models introduced in this paper with SMOTE, demonstrating their effectiveness in a number of problems.