Stream operators for querying data streams

  • Authors:
  • Lisha Ma;Stratis D. Viglas;Meng Li;Qian Li

  • Affiliations:
  • School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, United Kingdom;School of Informatics, University of Edinburgh, Edinburgh, United Kingdom;School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, United Kingdom;School of Informatics, University of Edinburgh, Edinburgh, United Kingdom

  • Venue:
  • WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the most important uses of aggregate queries over data streams is sampling. Typically, aggregation is performed over sliding windows where queries return new results whenever the window contents change, a concept referred to as a continuous query. Existing data models and query languages for streams are not capable of expressing many practical user-defined samplings over streams. To this end we propose a new data stream model, referred to as the sequence model, and a query language for specifying aggregate queries over data streams. We show that the sequence model can readily express a superset of the aggregate queries expressible in the previously proposed time-based data stream model, thus providing a declarative and formal semantics to understand and reason about continuous aggregate queries. Defined on top of the sequence model, our query language supports existing sliding window operators and a novel frequency operator. By using the frequency operator one is capable of expressing useful sampling queries, such as queries with user-defined group-based sampling and nested aggregation over either the input stream or the result stream. Such capabilities are beyond those of previously proposed query languages over streams. Finally, we conduct a preliminary experimental study that shows our language is effective and efficient in practice.