The anatomy of a stream processing system

Authors:
Altaf Gilani;Satyajeet Sonune;Balakumar Kendai;Sharma Chakravarthy
Affiliations:
Information Technology Laboratory, and Department of Computer Science and Engineering, The University of Texas at Arlington;Information Technology Laboratory, and Department of Computer Science and Engineering, The University of Texas at Arlington;Information Technology Laboratory, and Department of Computer Science and Engineering, The University of Texas at Arlington;Information Technology Laboratory, and Department of Computer Science and Engineering, The University of Texas at Arlington
Venue:
BNCOD'06 Proceedings of the 23rd British National Conference on Databases, conference on Flexible and Efficient Information Handling
Year:
2006

Citing 10
Cited 5

Eddies: continuously adaptive query processing

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
NiagaraCQ: a scalable continuous query system for Internet databases

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Models and issues in data stream systems

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Towards Sensor Database Systems

MDM '01 Proceedings of the Second International Conference on Mobile Data Management
MavHome: An Agent-Based Smart Home

PERCOM '03 Proceedings of the First IEEE International Conference on Pervasive Computing and Communications
Fjording the Stream: An Architecture for Queries Over Streaming Sensor Data

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Aurora: a new model and architecture for data stream management

The VLDB Journal — The International Journal on Very Large Data Bases
Streaming queries over streaming data

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Monitoring streams: a new class of data management applications

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Operator scheduling in a data stream manager

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29

Load Shedding in MavStream: Analysis, Implementation, and Evaluation

BNCOD '08 Proceedings of the 25th British national conference on Databases: Sharing Data, Information and Knowledge
Improving data analysis through diverse data source integration

MILCOM'09 Proceedings of the 28th IEEE conference on Military communications
Secure shared continuous query processing

Proceedings of the 2011 ACM Symposium on Applied Computing
Extensions to stream processing architecture for supporting event processing

DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
Multilevel secure data stream processing: Architecture and implementation

Journal of Computer Security - DBSec 2011

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data intensive applications such as network monitoring, financial applications; sensor-based applications etc. need to be supported by general-purpose systems rather than customized implementations. They have a continuous, unpredictable and unbounded flow of data as input, referred as streams. The fact that data comes as a stream with varying input rates (instead of accessing data stored on a disk in a predictable way) and that quality of service (QoS) requirements are stringent for these applications warrants a re-examination of the fundamental architecture of a DBMS. This paper describes the basic processing model and architecture of MavStream – a new Data Stream Management System (DSMS) being developed at UT Arlington. The architecture of MavStream is the primary focus of this paper. The user can give a continuous query from a graphical user interface (GUI), which is instantiated, scheduled, and executed by the MavStream server. We first provide an overview of the basic model and architecture and then describe some of the components of the system. We provide some experimental results to demonstrate the utility of the system and the effect of different scheduling strategies and buffer sizes on the performance and output.