Scalable event matching for overlapping subscriptions in pub/sub systems

Authors:
Zhen Liu;Srinivasan Parthasarthy;Anand Ranganathan;Hao Yang
Affiliations:
IBM T. J. Watson Research Center, Hawthorne, NY;IBM T. J. Watson Research Center, Hawthorne, NY;IBM T. J. Watson Research Center, Hawthorne, NY;IBM T. J. Watson Research Center, Hawthorne, NY
Venue:
Proceedings of the 2007 inaugural international conference on Distributed event-based systems
Year:
2007

Citing 19
Cited 3

Predicate migration: optimizing queries with expensive predicates

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Improved low-degree testing and its applications

STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Matching events in a content-based subscription system

Proceedings of the eighteenth annual ACM symposium on Principles of distributed computing
Exploiting an event-based infrastructure to develop complex distributed systems

Proceedings of the 20th international conference on Software engineering
Optimization of queries with user-defined predicates

ACM Transactions on Database Systems (TODS)
NiagaraCQ: a scalable continuous query system for Internet databases

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Pipelining in multi-query optimization

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Materialized view selection and maintenance using multi-query optimization

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Design and evaluation of a wide-area event notification service

ACM Transactions on Computer Systems (TOCS)
Continuously adaptive continuous queries over streams

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Efficient Filtering of XML Documents for Selective Dissemination of Information

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
WebFilter: A High-throughput XML-based Publish and Subscribe System

Proceedings of the 27th International Conference on Very Large Data Bases
Efficient filtering of XML documents with XPath expressions

The VLDB Journal — The International Journal on Very Large Data Bases
Efficient information gathering on the Internet

FOCS '96 Proceedings of the 37th Annual Symposium on Foundations of Computer Science
Design and Evaluation of Alternative Selection Placement Strategies in Optimizing Continuous Queries

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
The description logic handbook: theory, implementation, and applications

The description logic handbook: theory, implementation, and applications
An ontology-based publish/subscribe system

Proceedings of the 5th ACM/IFIP/USENIX international conference on Middleware
G-ToPSS: fast filtering of graph-based metadata

WWW '05 Proceedings of the 14th international conference on World Wide Web
S-ToPSS: semantic Toronto publish/subscribe system

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29

Scalable ranked publish/subscribe

Proceedings of the VLDB Endowment
A framework of sensor-cloud integration opportunities and challenges

Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
A dynamic and fast event matching algorithm for a content-based publish/subscribe information dissemination system in Sensor-Grid

The Journal of Supercomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Content-based publish/subscribe systems allow matching the content of events with predicates in the subscriptions. However, most existing systems only allow a limited set of operators, such as comparison on primitive data types (string, integer, etc). In this paper, we consider a publish/subscribe system that supports more flexible events/subscriptions with the use of advanced, yet potentially expensive, matching operators. Examples of such operators are pattern recognizers on multimedia data and spatial operators on location data. We study a critical problem in these publish/subscribe systems, namely how to optimize the matching process for a large number of subscriptions. This is achieved by exploiting the overlap in the subscriptions and sharing the operator evaluation results whenever possible. We formulate the optimal subscription evaluation problem and show that it is NP-Hard. We propose an efficient d-approximation algorithm, where d is the maximum number of operators in one subscription, as well as a heuristic algorithm that can further improve the system performance in practice. Our experiment results show that the proposed algorithms can reduce the matching cost by up to 80%, as compared to a naive strategy that evaluates the subscriptions independently.