Distributing frequency-dependent data stream computations

  • Authors:
  • Sumit Ganguly

  • Affiliations:
  • Indian Institute of Technology, Kanpur, India

  • Venue:
  • CATS '09 Proceedings of the Fifteenth Australasian Symposium on Computing: The Australasian Theory - Volume 94
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data stream computations in domains such as internet applications are often performed in a highly distributed fashion in order to save time. An example is the class of applications that use the Google Mapreduce framework of scalable distributed processing as presented by (Dean & Ghemawat 2004). A basic question here is: what kind of data stream computations admit scalable and efficient distributed algorithms? We show that the class of data stream computations that approximate functions of the frequency vector of the stream can be computed efficiently in a distributed manner.