Mining frequent patterns without candidate generation
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Models and issues in data stream systems
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Maintaining Stream Statistics over Sliding Windows
SIAM Journal on Computing
Fast Algorithms for Mining Association Rules in Large Databases
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
estWin: adaptively monitoring the recent change of frequent itemsets over online data streams
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Moment: Maintaining Closed Frequent Itemsets over a Stream Sliding Window
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Update-pattern-aware modeling and processing of continuous queries
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Revision Processing in a Stream Processing Engine: A High-Level Design
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
ACM Computing Surveys (CSUR)
Verifying and Mining Frequent Patterns from Large Windows over Data Streams
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Evaluating top-k queries over incomplete data streams
Proceedings of the 18th ACM conference on Information and knowledge management
Complexity analysis of depth first and FP-growth implementations of APRIORI
MLDM'03 Proceedings of the 3rd international conference on Machine learning and data mining in pattern recognition
ABS: The Anti Bouncing Model for Usage Data Streams
ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
SMM: A data stream management system for knowledge discovery
ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering
UpStream: storage-centric load management for streaming applications with update semantics
The VLDB Journal — The International Journal on Very Large Data Bases
Modeling and Clustering Users with Evolving Profiles in Usage Streams
TIME '12 Proceedings of the 2012 19th International Symposium on Temporal Representation and Reasoning
Hi-index | 0.00 |
In many data streaming applications today, tuples inside the streams may get revised over time. This type of data stream brings new issues and challenges to the data mining tasks. We present a theoretical analysis for mining frequent itemsets from sliding windows over such data. We define conditions that determine whether an infrequent itemset will become frequent when some existing tuples inside the streams have been updated. We design simple but effective structures for managing both the evolving tuples and the candidate frequent itemsets. Moreover, we provide a novel verification method that efficiently computes the counts of candidate itemsets. Experiments on real-world datasets show the efficiency and effectiveness of our proposed method.