The frequent items problem, under polynomial decay, in the streaming model

  • Authors:
  • Guy Feigenblat;Ofra Itzhaki;Ely Porat

  • Affiliations:
  • Department of Computer Science, Bar-Ilan University, Ramat Gan 52900, Israel;Department of Computer Science, Bar-Ilan University, Ramat Gan 52900, Israel;Department of Computer Science, Bar-Ilan University, Ramat Gan 52900, Israel

  • Venue:
  • Theoretical Computer Science
  • Year:
  • 2010

Quantified Score

Hi-index 5.23

Visualization

Abstract

We consider the problem of estimating the frequency count of data stream elements under polynomial decay functions. In these settings every element in the stream is assigned with a time-decreasing weight, using a non-increasing polynomial function. Decay functions are used in applications where older data is less significant, less interesting or even less reliable than recent data. Consider a data stream of N elements drawn from a universe U. We propose three poly-logarithmic algorithms for the problem. The first one, deterministic, uses O(1@e^2logN(loglogN+logU)) bits, where @e@?(0,1) is the approximation parameter. The second one, probabilistic, uses O(1@e^2logN@dlog1@e) bits or O(1@e^2logN@dlogN) bits, depending on the decay function parameter, where @d@?(0,1) is the probability of failure. The third one, deterministic in the stochastic model, uses O(1@elogU) bits or O(1@e^2logN) bits, also depending on the decay parameter as will be described in this paper. This variant of the problem is important and has many applications. To our knowledge, it has never been studied before.