Identifying frequent items in sliding windows over on-line packet streams

  • Authors:
  • Lukasz Golab;David DeHaan;Erik D. Demaine;Alejandro Lopez-Ortiz;J. Ian Munro

  • Affiliations:
  • University of Waterloo;University of Waterloo;M.I.T;University of Waterloo;University of Waterloo

  • Venue:
  • Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Internet traffic patterns are believed to obey the power law, implying that most of the bandwidth is consumed by a small set of heavy users. Hence, queries that return a list of frequently occurring items are important in the analysis of real-time Internet packet streams. While several results exist for computing frequent item queries using limited memory in the infinite stream model, in this paper we consider the limited-memory sliding window model. This model maintains the last $N$ items that have arrived at any given time and forbids the storage of the entire window in memory. We present a deterministic algorithm for identifying frequent items in sliding windows defined over real-time packet streams. The algorithm uses limited memory, requires constant processing time per packet (amortized), makes only one pass over the data, and is shown to work well when tested on TCP traffic logs.