A Scalable Approach to Approximating Aggregate Queries over Intermittent Streams

  • Authors:
  • Shanzhong Zhu;Chinya Ravishankar

  • Affiliations:
  • University of California, Riverside;University of California, Riverside

  • Venue:
  • SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a novel approach to approximate evaluationof standing aggregate queries over streaming data, subjectto user-specified error bounds. Our method models thebehavior of aggregates as Brownian motions, and adaptivelyupdates the model according to stream characteristics.This approach has two advantages. First, it greatly improvessystem scalability since we can defer query evaluationas long as the difference between the returned andtrue aggregate values remains within user-specified bounds.Second, we are able to provide approximate answers duringstream interruptions by estimating the rate at which thestreams and the aggregate drift during the blackout periods.We also study processor allocation issues in such approximateaggregate evaluation systems. Our experiments showthat our model captures the behavior of real-world streamssuch as sensor data and stock traces with excellent fidelity,and scales very well for large numbers of standing queries.