Optimizing complex sequence pattern extraction using caching

  • Authors:
  • Medhabi Ray;Mo Liu;Elke Rundensteiner;Daniel J. Dougherty;Chetan Gupta;Song Wang;Abhay Mehta;Ismail Ari

  • Affiliations:
  • Computer Science Department, Worcester Polytechnic Institute, USA;Hewlett Packard Innovation Research Laboratories, USA;Computer Science Department, Worcester Polytechnic Institute, USA;Computer Science Department, Worcester Polytechnic Institute, USA;Hewlett Packard Innovation Research Laboratories, USA;Hewlett Packard Innovation Research Laboratories, USA;Hewlett Packard Innovation Research Laboratories, USA;Ozyegin University, Turkey

  • Venue:
  • ICDEW '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering Workshops
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Complex Event Processing (CEP) has become increasingly important for tracking and monitoring complex event anomalies and trends in event streams emitted from business processes such as supply chain management to online stores in e-commerce. These monitoring applications submit complex event queries to track sequences of events that match a given pattern. The state-of-the-art CEP systems mostly focus on the execution of flat sequence queries, we instead support the execution of nested CEP queries specified by our NEsted Event Language NEEL. However, the iterative execution of nested CEP expressions often results in the repeated recomputation of the same or similar results for nested subexpressions as the window slides over the event stream. In this work we thus propose to optimize NEEL execution performance by caching intermediate results. In particular we design two methods of applying selective caching of intermediate results namely Object Caching and the Interval-Driven Semantic Caching. Techniques for incrementally loading, purging and exploiting the cache content are described. Our experimental study using real-world stock trades evaluates the performance of our proposed caching strategies for different query types.