Generalized association rule mining using an efficient data structure

  • Authors:
  • Chieh-Ming Wu;Yin-Fu Huang

  • Affiliations:
  • Graduate School of Engineering Science and Technology, National Yunlin University of Science and Technology, 123 University Road, Section 3, Touliu, Yunlin 640, Taiwan, ROC;Graduate School of Engineering Science and Technology, National Yunlin University of Science and Technology, 123 University Road, Section 3, Touliu, Yunlin 640, Taiwan, ROC

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2011

Quantified Score

Hi-index 12.06

Visualization

Abstract

The goal of this paper is to use an efficient data structure to find the generalized association rules between the items at different levels in a taxonomy tree under the assumption that the original frequent itemsets and association rules were generated in advance. The primary challenge of designing an efficient mining algorithm is how to make use of the original frequent itemsets and association rules to directly generate new generalized association rules, rather than rescanning the database. In the paper, we used an efficient data structure called the frequent closed enumeration table (FCET) to store the relevant information. It stores only maximal itemsets, and can be used to derive the information of the subset itemsets in a maximal itemset through a hash function. In the proposed algorithms GMAR and GMFI, we used join methods and/or pruning techniques to generate new generalized association rules. Through several comprehensive experiments, we found that both algorithms are much better than BASIC and Cumulate algorithms also using the efficient data structure (FCET), owing to fewer candidate itemsets generated by GMAR and GMFI. Furthermore, the GMAR algorithm prunes a large amount of irrelevant rules based on the minimum confidence.