SCIS: combining instance selection methods to increase their effectiveness over a wide range of domains

  • Authors:
  • Yoel Caises;Antonio González;Enrique Leyva;Raúl Pérez

  • Affiliations:
  • Facultad de Informática y Matemática, Universidad de Holguín, Cuba;Dpto de Ciencias de la Computación e IA, ETSIIT, Universidad de Granada, España;Facultad de Informática y Matemática, Universidad de Holguín, Cuba;Dpto de Ciencias de la Computación e IA, ETSIIT, Universidad de Granada, España

  • Venue:
  • IDEAL'09 Proceedings of the 10th international conference on Intelligent data engineering and automated learning
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Instance selection is a feasible strategy to solve the problem of dealing with large databases in inductive learning. There are several proposals in this area, but none of them consistently outperforms the others over a wide range of domains. In this paper we present a set of measures to characterize the databases, as well as a new algorithm that uses these measures and, depending on the data characteristics, it applies the method or combination of methods expected to produce the best results. This approach was evaluated over 20 databases and with six different learning paradigms. The results have been compared with those achieved by five well-known state-of-the-art methods.