From local to global patterns: evaluation issues in rule learning algorithms

Authors:
Johannes Fürnkranz
Affiliations:
Knowledge Engineering Group, TU Darmstadt, Darmstadt, Germany
Venue:
LPD'04 Proceedings of the 2004 international conference on Local Pattern Detection
Year:
2004

Citing 20
Cited 2

Rule induction with CN2: some recent improvements

EWSL-91 Proceedings of the European working session on learning on Machine learning
FOSSIL: a robust relational learner

ECML-94 Proceedings of the European conference on machine learning on Machine Learning
Explora: a multipattern and multistrategy discovery assistant

Advances in knowledge discovery and data mining
Fast discovery of association rules

Advances in knowledge discovery and data mining
Separate-and-Conquer Rule Learning

Artificial Intelligence Review
A simple, fast, and effective rule learner

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Algorithms for association rule mining — a general survey and comparison

ACM SIGKDD Explorations Newsletter
Generating Accurate Rule Sets Without Global Optimization

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Lightweight Rule Induction

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
An Algorithm for Multi-relational Discovery of Subgroups

PKDD '97 Proceedings of the First European Symposium on Principles of Data Mining and Knowledge Discovery
Confirmation Rule Sets

PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Rule Evaluation Measures: A Unifying View

ILP '99 Proceedings of the 9th International Workshop on Inductive Logic Programming
Ensemble Methods in Machine Learning

MCS '00 Proceedings of the First International Workshop on Multiple Classifier Systems
Selecting the right interestingness measure for association patterns

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Finding the most interesting patterns in a database quickly by using sequential sampling

The Journal of Machine Learning Research
Subgroup Discovery with CN2-SD

The Journal of Machine Learning Research
Delegating classifiers

ICML '04 Proceedings of the twenty-first international conference on Machine learning
ROC `n' Rule Learning—Towards a Better Understanding of Covering Algorithms

Machine Learning
Concept learning and the problem of small disjuncts

IJCAI'89 Proceedings of the 11th international joint conference on Artificial intelligence - Volume 1
Modern Applied Statistics with S

Modern Applied Statistics with S

Clinical data analysis based on iterative subgroup discovery: experiments in brain ischaemia data analysis

Applied Intelligence
Cluster-grouping: from subgroup discovery to clustering

Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Separate-and-conquer or covering rule learning algorithms may be viewed as a technique for using local pattern discovery for generating a global theory. Local patterns are learned one at a time, and each pattern is evaluated in a local context, with respect to the number of positive and negative examples that it covers. Global context is provided by removing the examples that are covered by previous patterns before learning a new rule. In this paper, we discuss several research issues that arise in this context. We start with a brief discussion of covering algorithms, their problems, and review a few suggestions for resolving them. We then discuss the suitability of a well-known family of evaluation metrics, and analyze how they trade off coverage and precision of a rule. Our conclusion is that in many applications, coverage is only needed for establishing statistical significance, and that the rule discovery process should focus on optimizing precision. As an alternative to coverage-based overfitting avoidance, we then investigate the feasibility of meta-learning a predictor for the true precision of a rule, based on its coverage on the training set. The results confirm that this is a valid approach, but also point at some shortcomings that need to be addressed in future work.