Pruning classification rules with reference vector selection methods

  • Authors:
  • Karol Grudziński;Marek Grochowski;Włodzisław Duch

  • Affiliations:
  • Institute of Physics, Kazimierz Wielki University, Bydgoszcz, Poland;Dept. of Informatics, Nicolaus Copernicus University, Poland;Dept. of Informatics, Nicolaus Copernicus University, Poland

  • Venue:
  • ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Attempts to extract logical rules from data often lead to large sets of classification rules that need to be pruned. Training two classifiers, the C4.5 decision tree and the Non-Nested Generalized Exemplars (NNGE) covering algorithm, on datasets that have been reduced earlier with the EkP instance compressor leads to statistically significantly lower number of derived rules with nonsignificant degradation of results. Similar results have been observed with other popular instance filters used for data pruning. Numerical experiments presented here illustrate that it is possible to extract more interesting and simpler sets of rules from filtered datasets. This enables a better understanding of knowledge structures when data is explored using algorithms that tend to induce a large number of classification rules.