Incremental aggregation on multiple continuous queries

  • Authors:
  • Chun Jin;Jaime Carbonell

  • Affiliations:
  • Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA;Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA

  • Venue:
  • ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

Continuously monitoring large-scale aggregates over data streams is important for many stream processing applications, e.g. collaborative intelligence analysis, and presents new challenges to data management systems. The first challenge is to efficiently generate the updated aggregate values and provide the new results to users after new tuples arrive. We implemented an incremental aggregation mechanism for doing so for arbitrary algebraic aggregate functions including user-defined ones by keeping up-to-date finite data summaries. The second challenge is to construct shared query evaluation plans to support large-scale queries effectively. Since multiple query optimization is NP-complete and the queries generally arrive asynchronously, we apply an incremental sharing approach to obtain the shared plans that perform reasonably well. The system is built as a part of ARGUS, a stream processing system atop of a DBMS. The evaluation study shows that our approaches are effective and efficient on typical collaborative intelligence analysis data and queries.