Tight correlated item sets and their efficient discovery

  • Authors:
  • Lizheng Jiang;Dongqing Yang;Shiwei Tang;Xiuli Ma;Dehui Zhang

  • Affiliations:
  • School of Electronics Engineering and Computer Science, Peking University;School of Electronics Engineering and Computer Science, Peking University;School of Electronics Engineering and Computer Science, Peking University and National Laboratory on Machine Perception, Peking University;National Laboratory on Machine Perception, Peking University;National Laboratory on Machine Perception, Peking University

  • Venue:
  • APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study the problem of mining correlated patterns. Correlated patterns have advantages over associations that they cover not only frequent items, but also rare items. Tight correlated item sets is a concise representation of correlated patterns, where items are correlated each other. Although finding such tight correlated item sets is helpful for applications, the algorithm's efficiency is critical, especially for high dimensional database. Thus, we first prove Lemma 1 and Lemma 2 in theory. Utilizing Lemma 1 and Lemma 2, we design an optimized RSC (Regional-Searching-Correlations) algorithm. Furthermore, we estimate the amount of pruned search space for data with various support distributions based on a probabilistic model. Experiment results demonstrate that RSC algorithm is much faster than other similar algorithms.