An Efficient Algorithm for Maintaining Frequent Closed Itemsets over Data Stream

  • Authors:
  • Show-Jane Yen;Yue-Shi Lee;Cheng-Wei Wu;Chin-Lin Lin

  • Affiliations:
  • Department of Computer Science and Information Engineering, Ming Chuan University, Taoyuan County, Taiwan 333;Department of Computer Science and Information Engineering, Ming Chuan University, Taoyuan County, Taiwan 333;Department of Computer Science and Information Engineering, Ming Chuan University, Taoyuan County, Taiwan 333;Department of Computer Science and Information Engineering, Ming Chuan University, Taoyuan County, Taiwan 333

  • Venue:
  • IEA/AIE '09 Proceedings of the 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: Next-Generation Applied Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data mining refers to the process of revealing unknown and potentially useful information from a large database. Frequent itemsets mining is one of the foundational problems in data mining, which is to discover the set of products that purchased frequently together by customers from a transaction database. However, there may be a large number of patterns generated from database, and many of them are redundant. Frequent closed itemset is a well-known condensed representation of frequent itemset, and it provides complete information of frequent itemsets. Extensive studies have been proposed for mining frequent closed itemsets from transaction database, but most of them do not take streaming data into consideration. In this paper, we propose an efficient algorithm for maintaining frequent closed itemsets over data streams. Whenever a transaction is added to database, our approach incrementally updates the information of closed itemsets and outputs updated frequent closed itemsets based on user-specified thresholds. The experimental results show that our approach outperforms previous studies.