Mining high utility itemsets in large high dimensional data

  • Authors:
  • Guangzhu Yu;Keqing Li;Shihuang Shao

  • Affiliations:
  • Donghua University, Shanghai, China;Yangtze University, Jingzhou, China;Donghua University, Shanghai, China

  • Venue:
  • Proceedings of the 1st international conference on Forensic applications and techniques in telecommunications, information, and multimedia and workshop
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Existing algorithms for utility mining are inadequate on datasets with high dimensions or long patterns. This paper proposes a hybrid method, which is composed of a row enumeration algorithm (i.e., Inter-transaction) and a column enumeration algorithm (i.e., Two-phase), to discover high utility itemsets from two directions: Two-phase seeks short high utility itemsets from the bottom, while Inter-transaction seeks long high utility itemsets from the top. In addition, optimization technique is adopted to improve the performance of computing the intersection of transactions. Experiments on synthetic data show that the hybrid method achieves high performance in large high dimensional datasets.