A bayesian scoring technique for mining predictive and non-spurious rules

  • Authors:
  • Iyad Batal;Gregory Cooper;Milos Hauskrecht

  • Affiliations:
  • Department of Computer Science, University of Pittsburgh;Department of Biomedical Informatics, University of Pittsburgh;Department of Computer Science, University of Pittsburgh

  • Venue:
  • ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Rule mining is an important class of data mining methods for discovering interesting patterns in data. The success of a rule mining method heavily depends on the evaluation function that is used to assess the quality of the rules. In this work, we propose a new rule evaluation score - the Predictive and Non-Spurious Rules (PNSR) score. This score relies on Bayesian inference to evaluate the quality of the rules and considers the structure of the rules to filter out spurious rules. We present an efficient algorithm for finding rules with high PNSR scores. The experiments demonstrate that our method is able to cover and explain the data with a much smaller rule set than existing methods.