An improved data mining approach using predictive itemsets

  • Authors:
  • Tzung-Pei Hong;Chyan-Yuan Horng;Chih-Hung Wu;Shyue-Liang Wang

  • Affiliations:
  • Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung 811, Taiwan, ROC;Institute of Information Engineering, I-Shou University, Kaohsiung 840, Taiwan, ROC;Department of Electrical Engineering, National University of Kaohsiung, Kaohsiung 811, Taiwan, ROC;Department of Information Management, National University of Kaohsiung, Kaohsiung 811, Taiwan, ROC

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2009

Quantified Score

Hi-index 12.05

Visualization

Abstract

In this paper, we present a mining algorithm to improve the efficiency of finding large itemsets. Based on the concept of prediction proposed in the (n,p) algorithm, our method considers the data dependency in the given transactions to predict promising and non-promising candidate itemsets. Our method estimates for each level a different support threshold that is derived from a data dependency parameter and determines whether an item should be included in a promising candidate itemset directly. In this way, we maintain the efficiency of finding large itemsets by reducing the number of scanning the input dataset and the number candidate items. Experimental results show our method has a better efficiency than the apriori and the (n,p) algorithms when the minimum support value is small.