Toward massive query optimization in large-scale distributed stream systems

  • Authors:
  • Yongluan Zhou;Karl Aberer;Kian-Lee Tan

  • Affiliations:
  • University of Southern Denmark;EPFL, Switzerland;National University of Singapore

  • Venue:
  • Proceedings of the 9th ACM/IFIP/USENIX International Conference on Middleware
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Existing distributed stream systems adopt a tightly-coupled communication paradigm and focus on fine-tuning of operator placements to achieve communication efficiency. This kind of approach is hard to scale (both to the nodes in the network and the users). In this paper, we propose a fundamentally different approach and present the design of a middleware for optimizing massive queries. Our approach takes the advantages of existing Publish/Subscribe systems (Pub/Sub) to achieve loosely-coupled communication and to "intelligently" exploit the sharing of communication among different queries. To fully exploit the capability of a Pub/Sub, we present a new query distribution algorithm, which can adaptively and rapidly (re)distribute the streaming queries at runtime to achieve both load balancing and low communication cost. Both the simulation studies and the prototype experiments executed on Planet-Lab show the effectiveness of our techniques.