An approach to QoS aware resource scheduling in data stream systems

  • Authors:
  • Sumit Ganguly;Pushpraj Shukla

  • Affiliations:
  • Indian Institute of Technology, Kanpur, Kanpur, India;University of Texas at Austin, Austin, TX

  • Venue:
  • SSPS '08 Proceedings of the 2nd international workshop on Scalable stream processing system
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In a data stream management system (DSMS), there are a large number of simultaneously executing queries, each of which performs a specific monitoring function on some subset of the streaming data. In order to keep up with the rapid arrival rates, a DSMS may sample or drop tuples to match execution rate with the arrival rate. However, as the sampling rate reduces, so does the quality of answers of the queries. Since, there may be substantial overlap between the queries that are simultaneously active in the system, an optimal choice of the points and extent of load-shedding cannot be obtained without taking a "global view" of this problem. In this paper, we formulate the problem of optimizing the rates of sampling with the the overall objective of minimizing the sum of the loss of quality of answers of the queries, assumed to be associated with each query (e.g., as in the Aurora system). Specifically, we consider the problem of obtaining the optimal sampling probabilities for a given collection of sliding-window queries with known loss of quality plots, so that the objective function of the sum of loss of quality across the queries is minimized. The problem can be generally posed and solved as a convex optimization problem and can be solved in low order polynomial time.