A model for continuous query latencies in data streams

  • Authors:
  • Roberto Baldoni;Giuseppe Antonio Di Luna;Donatella Firmani;Giorgia Lodi

  • Affiliations:
  • Universitá degli Studi di Roma "La Sapienza";Universitá degli Studi di Roma "La Sapienza";Universitá degli Studi di Roma "La Sapienza";Universitá degli Studi di Roma "La Sapienza"

  • Venue:
  • Proceedings of the First International Workshop on Algorithms and Models for Distributed Event Processing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose a formal model for characterizing latencies affecting the computation of a continuous query either in a Data Stream Management System (DSMS) or in a Complex Event Processing (CEP) system. In the model, a query can be thought of as constructed out of basic Event Processing Units (EPUs) interconnected among themselves. EPUs are modeled considering just few parameters, used to define the EPU processing logic. In order to model the continuous query we use an acyclic directed (data-flow) graph whose nodes are the EPUs and edges represent the flow of information (events) processed by the EPUs themselves. The outcome of this model is to associate with each dataflow graph a set of latency metrics, namely reactivity, activity, and output latencies, and a complexity measure - that we call data-flow graph complexity - representing the input dimension required to produce an output event. The proposed model can be used to compare and contrast different data-flow graphs in order to assess their latency metrics. This is a crucial step in selecting one of such graphs that meets at best the latency requirements imposed by the programmer before its actual submission to a DSMS or to a CEP system. Furthermore, the model can be considered an effective mean through which formally comparing dataflow graphs and predicting their behavior before an actual experimental validation phase.