InstanceRank based on borders for instance selection

  • Authors:
  • Pablo Hernandez-Leal;J. Ariel Carrasco-Ochoa;J. Fco. MartíNez-Trinidad;J. Arturo Olvera-Lopez

  • Affiliations:
  • National Institute of Astrophysics, Optics and Electronics, Computer Science Department, Luis Enrique Erro No. 1, Sta. María Tonantzintla, Puebla, CP 72840, Mexico;National Institute of Astrophysics, Optics and Electronics, Computer Science Department, Luis Enrique Erro No. 1, Sta. María Tonantzintla, Puebla, CP 72840, Mexico;National Institute of Astrophysics, Optics and Electronics, Computer Science Department, Luis Enrique Erro No. 1, Sta. María Tonantzintla, Puebla, CP 72840, Mexico;Benemérita Universidad Autónoma de Puebla, Computer Science Department, Av. San Claudio y 14 Sur, Ciudad Universitaria, Puebla, CP 72570, Mexico

  • Venue:
  • Pattern Recognition
  • Year:
  • 2013

Quantified Score

Hi-index 0.01

Visualization

Abstract

Instance selection algorithms are used for reducing the number of training instances. However, most of them suffer from long runtimes which results in the incapability to be used with large datasets. In this work, we introduce an Instance Ranking per class using Borders (instances near to instances belonging to different classes), using this ranking we propose an instance selection algorithm (IRB). We evaluated the proposed algorithm using k-NN with small and large datasets, comparing it against state of the art instance selection algorithms. In our experiments, for large datasets IRB has the best compromise between time and accuracy. We also tested our algorithm using SVM, LWLR and C4.5 classifiers, in all cases the selection computed by our algorithm obtained the best accuracies in average.