Mining frequent items in a stream using flexible windows

  • Authors:
  • Toon Calders;Nele Dexters;Bart Goethals

  • Affiliations:
  • TU Eindhoven, Eindhoven, The Netherlands;University of Antwerp, Antwerp, Belgium;University of Antwerp, Antwerp, Belgium

  • Venue:
  • Intelligent Data Analysis - Knowledge Discovery from Data Streams
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study the problem of finding frequent items in a continuous stream of itemsets. A new frequency measure is introduced, based on a flexible window length. For a given item, its current frequency in the stream is defined as the maximal frequency over all windows from any point in the past until the current state. We study the properties of the new measure, and propose an incremental algorithm that allows to produce the current frequency of an item immediately at any time. It is shown experimentally that the memory requirements of the algorithm are extremely small for many different realistic data distributions.