Size matters: finding the most informative set of window lengths
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Hi-index | 0.00 |
A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. Consequently, the knowledge embedded in a data stream is more likely to be changed as time goes by. Frequent pattern is a kind of data mining techniques discovered knowledge and has been widely studied over the last decade. There are several models and approaches for mining such knowledge, but all previous works only consider a static length of sliding window for mining frequent item sets. We propose a multiple sliding windows for mining frequent patterns on data stream in this paper. The details of study scope are as follows.We propose an efficient discounting method with different lengths of time-sensitive sliding-window. This discounting method doesn't lose the information about Acount and also saves much memory space. Finally, we implement and evaluate the proposed algorithms for mining frequent item sets on data stream.