An Approximate Lp-Difference Algorithm for Massive Data Streams

  • Authors:
  • Jessica H. Fong;Martin Strauss

  • Affiliations:
  • -;-

  • Venue:
  • STACS '00 Proceedings of the 17th Annual Symposium on Theoretical Aspects of Computer Science
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Several recent papers have shown how to approximate the difference Σi|ai - bi| or Σ|ai - bi|2 between two functions, when the function values ai and bi are given in a data stream, and their order is chosen by an adversary. These algorithms use little space (much less than would be needed to store the entire stream) and little time to process each item in the stream and give approximations with small relative error. Using different techniques, we show how to approximate the Lp- difference Σi |ai-bi|p for any rational-valued p ∈ (0; 2), with comparable efficiency and error. We also show how to approximate Σi |ai - bi|p for larger values of p but with a worse error guarantee. These results can be used to assess the difference between two chronologically or physically separated massive data sets, making one quick pass over each data set, without buffering the data or requiring the data source to pause.