Control considerations for scalable event processing

Authors:
Wei Xu;Joseph L. Hellerstein;Bill Kramer;David Patterson
Affiliations:
Computer Science Dept., University of California, Berkeley, CA;IBM T.J. Watson Research Center, Hawthorne, NY;Computer Science Dept., University of California, Berkeley, CA;Computer Science Dept., University of California, Berkeley, CA
Venue:
DSOM'05 Proceedings of the 16th IFIP/IEEE Ambient Networks international conference on Distributed Systems: operations and Management
Year:
2005

Citing 8
Cited 1

Achieving scalability and expressiveness in an Internet-scale event notification service

Proceedings of the nineteenth annual ACM symposium on Principles of distributed computing
Pinpoint: Problem Determination in Large, Dynamic Internet Services

DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining

ACM Transactions on Computer Systems (TOCS)
Feedback Control of Computing Systems

Feedback Control of Computing Systems
Intrusion detection using sequences of system calls

Journal of Computer Security
Querying the internet with PIER

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Predictive algorithms in the management of computer systems

IBM Systems Journal
High speed and robust event correlation

IEEE Communications Magazine

A stratified approach for supporting high throughput event processing applications

Proceedings of the Third ACM International Conference on Distributed Event-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The growth in the scale of systems and networks has created many challenges for their management, especially for event processing. Our premise is that scaling event processing requires parallelism. To this end, we observe that event processing can be divided into intra-event processing such as filtering and inter-event processing such as root cause analysis. Since intra-event processing is easily parallelized, we propose an architecture in which intra-event processing elements (IAPs) are replicated to scale to larger event input rates. We address two challenges in this architecture. First, the IAPs are subject to overloads that require effective flow control, a capability that was not present in the components we used to build IAPs. Second, we need to balance the loads on IAPs to avoid creating resource bottlenecks. These challenges are further complicated by the presence of disturbances such as CPU intensive administrative tasks that reduce event processing rates. We address these challenges using designs based on control theory, a technique for analyzing stability, accuracy, and settling times. We demonstrate the effectiveness of our approaches with testbed experiments that include a disturbance in the form of a CPU intensive application.