Error-adaptive and time-aware maintenance of frequency counts over data streams

  • Authors:
  • Hongyan Liu;Ying Lu;Jiawei Han;Jun He

  • Affiliations:
  • Tsinghua University, China;University of Illinois, Urbana, Champaign;University of Illinois, Urbana, Champaign;Renmin University of China, China

  • Venue:
  • WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Maintaining frequency counts for items over data stream has a wide range of applications such as web advertisement fraud detection. Study of this problem has attracted great attention from both researchers and practitioners. Many algorithms have been proposed. In this paper, we propose a new method, error-adaptive pruning method, to maintain frequency more accurately. We also propose a method called fractionization to record time information together with the frequency information. Using these two methods, we design three algorithms for finding frequent items and top-k frequent items. Experimental results show these methods are effective in terms of improving the maintenance accuracy.