Finding Maximal Frequent Itemsets over Online Data Streams Adaptively

Authors:
Daesu Lee;Wonsuk Lee
Affiliations:
Yonsei University;Yonsei University
Venue:
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Year:
2005

Citing 9
Cited 24

Dynamic itemset counting and implication rules for market basket data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Depth first generation of long patterns

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining time-changing data streams

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Mining a stream of transactions for customer patterns

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
The Item-Set Tree: A Data Structure for Data Mining

DaWaK '99 Proceedings of the First International Conference on Data Warehousing and Knowledge Discovery
Approximating a Data Stream for Querying and Estimation: Algorithms and Performance Evaluation

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Finding recent frequent itemsets adaptively over online data streams

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Approximate frequency counts over data streams

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases

On-line generation association rules over data streams

Information and Software Technology
Frequent pattern mining for kernel trace data

Proceedings of the 2008 ACM symposium on Applied computing
A survey on algorithms for mining frequent itemsets over data streams

Knowledge and Information Systems
Short communication: TOPSIS: Finding Top-K significant N-itemsets in sliding windows adaptively

Knowledge-Based Systems
Mining frequent items in a stream using flexible windows

Intelligent Data Analysis - Knowledge Discovery from Data Streams
Mining Maximal Frequent Itemsets in Data Streams Based on FP-Tree

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Frequent items in streaming data: An experimental evaluation of the state-of-the-art

Data & Knowledge Engineering
Mining non-derivable frequent itemsets over data stream

Data & Knowledge Engineering
Stream data clustering based on grid density and attraction

ACM Transactions on Knowledge Discovery from Data (TKDD)
A novel hash-based approach for mining frequent itemsets over data streams requiring less memory space

Data Mining and Knowledge Discovery
Online mining of temporal maximal utility itemsets from data streams

Proceedings of the 2010 ACM Symposium on Applied Computing
MMFI_DSSW: a new method to incrementally mine maximal frequent itemsets in transaction sensitive sliding window

KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management
Mining time-delayed associations from discrete event datasets

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
A new algorithm for mining global frequent itemsets in a stream

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
A generic approach for mining indirect association rules in data streams

IEA/AIE'11 Proceedings of the 24th international conference on Industrial engineering and other applications of applied intelligent systems conference on Modern approaches in applied intelligence - Volume Part I
Quality-driven resource-adaptive data stream mining?

ACM SIGKDD Explorations Newsletter
The augmented itemset tree: a data structure for online maximum frequent pattern mining

DS'11 Proceedings of the 14th international conference on Discovery science
A false negative maximal frequent itemset mining algorithm over stream

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part I
Efficient algorithms for mining maximal high utility itemsets from data streams with different models

Expert Systems with Applications: An International Journal
Size matters: finding the most informative set of window lengths

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Self-configuring data mining for ubiquitous computing

Information Sciences: an International Journal
Identifying streaming frequent items in ad hoc time windows

Data & Knowledge Engineering
Mining frequent itemsets in a stream

Information Systems
Efficient frequent itemset mining methods over time-sensitive streams

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Due to the characteristics of a data stream, it is very important to confine the memory usage of a data mining process regardless of the amount of information generated in the data stream. For this purpose, this paper proposes a CP-tree (Compressed-prefix tree)that can be effectively used in finding either frequent or maximal frequent itemsets over an online data stream. Unlike a prefix tree, a node of a CP-tree can maintain the information of several itemsets together. Based on this characteristic, the size of a CP-tree can be flexibly controlled by merging or splitting nodes. In this paper, a mining method employing a CP-tree is proposed and an adaptive memory utilization scheme is also presented in order to maximize the mining accuracy of the proposed method for confined memory space at all times. Finally, the performance of the proposed method is analyzed by a series of experiments to identify its various characteristics.