A further study in the data partitioning approach for frequent itemsets mining

  • Authors:
  • Son N. Nguyen;Maria E. Orlowska

  • Affiliations:
  • School of Information Technology and Electrical Engineering, The University of Queensland, QLD, Australia;School of Information Technology and Electrical Engineering, The University of Queensland, QLD, Australia

  • Venue:
  • ADC '06 Proceedings of the 17th Australasian Database Conference - Volume 49
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Frequent itemsets mining is well explored for various data types, and its computational complexity is well understood. Based on our previous work by Nguyen and Orlowska (2005), this paper shows the extension of the data pre-processing approach to further improve the performance of frequent itemsets computation. The methods focus on potential reduction of the size of the input data required for deployment of the partitioning based algorithms.We have made a series of the data pre-processing methods such that the final step of the Partition algorithm, where a combination of all local candidate sets must be processed, is executed on substantially smaller input data. Moreover, we have made a comparison among these methods based on the experiments with particular data sets.