Parametric Algorithms for Mining Share Frequent Itemsets

  • Authors:
  • Brock Barber;Howard J. Hamilton

  • Affiliations:
  • Department of Computer Science, University of Regina, Regina, SK, Canada S4S 0A2;Department of Computer Science, University of Regina, Regina, SK, Canada S4S 0A2. hamilton@cs.uregina.ca

  • Venue:
  • Journal of Intelligent Information Systems
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Itemset share, the fraction of some numerical total contributed by items when they occur in itemsets, has been proposed as a measure of the importance of itemsets in association rule mining. The IAB and CAC algorithms are able to find share frequent itemsets that have infrequent subsets. These algorithms perform well, but they do not always find all possible share frequent itemsets. In this paper, we describe the incorporation of a threshold factor into these algorithms. The threshold factor can be used to increase the number of frequent itemsets found at a cost of an increase in the number of infrequent itemsets examined. The modified algorithms are tested on a large commercial database. Their behavior is examined using principles of classifier evaluation from machine learning.