From local to global patterns: evaluation issues in rule learning algorithms

  • Authors:
  • Johannes Fürnkranz

  • Affiliations:
  • Knowledge Engineering Group, TU Darmstadt, Darmstadt, Germany

  • Venue:
  • LPD'04 Proceedings of the 2004 international conference on Local Pattern Detection
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Separate-and-conquer or covering rule learning algorithms may be viewed as a technique for using local pattern discovery for generating a global theory. Local patterns are learned one at a time, and each pattern is evaluated in a local context, with respect to the number of positive and negative examples that it covers. Global context is provided by removing the examples that are covered by previous patterns before learning a new rule. In this paper, we discuss several research issues that arise in this context. We start with a brief discussion of covering algorithms, their problems, and review a few suggestions for resolving them. We then discuss the suitability of a well-known family of evaluation metrics, and analyze how they trade off coverage and precision of a rule. Our conclusion is that in many applications, coverage is only needed for establishing statistical significance, and that the rule discovery process should focus on optimizing precision. As an alternative to coverage-based overfitting avoidance, we then investigate the feasibility of meta-learning a predictor for the true precision of a rule, based on its coverage on the training set. The results confirm that this is a valid approach, but also point at some shortcomings that need to be addressed in future work.