Pruning classification rules with reference vector selection methods

Authors:
Karol Grudziński;Marek Grochowski;Włodzisław Duch
Affiliations:
Institute of Physics, Kazimierz Wielki University, Bydgoszcz, Poland;Dept. of Informatics, Nicolaus Copernicus University, Poland;Dept. of Informatics, Nicolaus Copernicus University, Poland
Venue:
ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
Year:
2010

Citing 7
Cited 2

C4.5: programs for machine learning

C4.5: programs for machine learning
Reduction Techniques for Instance-BasedLearning Algorithms

Machine Learning
Advances in Instance Selection for Instance-Based Learning Algorithms

Data Mining and Knowledge Discovery
Generating Accurate Rule Sets Without Global Optimization

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Geometric decision rules for instance-based learning problems

PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
A new methodology of extraction, optimization and application of crisp and fuzzy logical rules

IEEE Transactions on Neural Networks

Simple incremental instance selection wrapper for classification

ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
Preceding rule induction with instance reduction methods

MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

Attempts to extract logical rules from data often lead to large sets of classification rules that need to be pruned. Training two classifiers, the C4.5 decision tree and the Non-Nested Generalized Exemplars (NNGE) covering algorithm, on datasets that have been reduced earlier with the EkP instance compressor leads to statistically significantly lower number of derived rules with nonsignificant degradation of results. Similar results have been observed with other popular instance filters used for data pruning. Numerical experiments presented here illustrate that it is possible to extract more interesting and simpler sets of rules from filtered datasets. This enables a better understanding of knowledge structures when data is explored using algorithms that tend to induce a large number of classification rules.