Rule induction with CN2: some recent improvements
EWSL-91 Proceedings of the European working session on learning on Machine learning
Explora: a multipattern and multistrategy discovery assistant
Advances in knowledge discovery and data mining
Bump hunting in high-dimensional data
Statistics and Computing
Machine Learning
Adaptive Directed Acyclic Graphs for Multiclass Classification
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Rule Evaluation Measures: A Unifying View
ILP '99 Proceedings of the 9th International Workshop on Inductive Logic Programming
Data mining tasks and methods: Subgroup discovery: deviation analysis
Handbook of data mining and knowledge discovery
Subgroup Discovery with CN2-SD
The Journal of Machine Learning Research
Fast Binary Feature Selection with Conditional Mutual Information
The Journal of Machine Learning Research
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Statistical Comparisons of Classifiers over Multiple Data Sets
The Journal of Machine Learning Research
Covering vs divide-and-conquer for top-down induction of logic programs
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
BioDM'06 Proceedings of the 2006 international conference on Data Mining for Biomedical Applications
A comparison of methods for multiclass support vector machines
IEEE Transactions on Neural Networks
The Advantages of Seed Examples in First-Order Multi-class Subgroup Discovery
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
First-Order Multi-class Subgroup Discovery
Proceedings of the 2010 conference on STAIRS 2010: Proceedings of the Fifth Starting AI Researchers' Symposium
Learning multi-class theories in ILP
ILP'10 Proceedings of the 20th international conference on Inductive logic programming
Hi-index | 0.00 |
Subgroup discovery aims at finding subsets of a population whose class distribution is significantly different from the overall distribution. It has previously predominantly been investigated in a two-class context. This paper investigates multi-class subgroup discovery methods. We consider six evaluation measures for multi-class subgroups, four of them new, and study their theoretical properties. We extend the two-class subgroup discovery algorithm CN2-SD to incorporate the new evaluation measures and a new weighting scheme inspired by AdaBoost. We demonstrate the usefulness of multi-class subgroup discovery experimentally, using discovered subgroups as features for a decision tree learner. Not only is the number of leaves of the decision tree reduced with a factor between 8 and 16 on average, but significant improvements in accuracy and AUC are achieved with particular evaluation measures and settings. Similar performance improvements can be observed when using naive Bayes.