Effective elimination of redundant association rules

  • Authors:
  • James Cheng;Yiping Ke;Wilfred Ng

  • Affiliations:
  • Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong, China;Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong, China;Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong, China

  • Venue:
  • Data Mining and Knowledge Discovery
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

It is well-recognized that the main factor that hinders the applications of Association Rules (ARs) is the huge number of ARs returned by the mining process. In this paper, we propose an effective solution that presents concise mining results by eliminating the redundancy in the set of ARs. We adopt the concept of 驴 tolerance to define the set of 驴-Tolerance ARs (驴-TARs), which is a concise representation for the set of ARs. The notion of 驴-tolerance is a relaxation on the closure defined on the support of frequent itemsets, thus allowing us to effectively prune the redundant ARs. We devise a set of inference rules, with which we prove that the set of 驴-TARs is a non-redundant representation of ARs. In addition, we prove that the set of ARs that is derived from the 驴-TARs by the inference rules is sound and complete. We also develop a compact tree structure called the 驴-TAR tree, which facilitates the efficient generation of the 驴-TARs and derivation of other ARs. Experimental results verify the efficiency of using the 驴-TAR tree to generate the 驴-TARs and to query the ARs. The set of 驴-TARs is shown to be significantly smaller than the state-of-the-art concise representations of ARs. In addition, the approximation on the support and confidence of the ARs derived from the 驴-TARs are highly accurate.