On the Mining of Substitution Rules for Statistically Dependent Items

  • Authors:
  • Wei-Guang Teng;Ming-Jyh Hsieh;Ming-Syan Chen

  • Affiliations:
  • -;-;-

  • Venue:
  • ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a new mining capability, called mining ofsubstitution rules, is explored. A substitution refers to thechoice made by a customer to replace the purchase of someitems with that of others. The process of mining substitutionrules can be decomposed into two procedures. The first procedureis to identify concrete itemsets among a large numberof frequent itemsets, where a concrete itemset is a frequentitemset whose items are statistically dependent. Thesecond procedure is then on the substitution rule generation.Two concrete itemsets X and Y form a substitutionrule, denoted by X \triangleright Y to mean that X is a substitute for Y,if and only if (1) X and Y are negatively correlated and (2)the negative association rule X \to \overline Y exists. In this paper,we derive theoretical properties for the model of substitutionrule mining. Then, in light of these properties, algorithmSRM (standing for substitution rule mining) is designedand implemented to discover the substitution rulesefficiently while attaining good statistical significance. Empiricalstudies are performed to evaluate the performance ofalgorithm SRM proposed. It is shown that algorithm SRMproduces substitution rules of very high quality.