Ad hoc star join query processing in cluster architectures

  • Authors:
  • Josep Aguilar-Saborit;Victor Muntés-Mulero;Calisto Zuzarte;Josep-L. Larriba-Pey

  • Affiliations:
  • Computer Architecture Department, Universitat Politecnica de Catalunya, Barcelona, Spain;Computer Architecture Department, Universitat Politecnica de Catalunya, Barcelona, Spain;IBM Toronto Lab., Markham, ON, Canada;Computer Architecture Department, Universitat Politecnica de Catalunya, Barcelona, Spain

  • Venue:
  • DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Processing of large amounts of data in data warehouses is increasingly being done in cluster architectures to achieve scalability. In this paper we look into the problem of ad hoc star join query processing in clusters architectures. We propose a new technique, the Star Hash Join (SHJ), which exploits a combination of multiple bit filter strategies in such architectures. SHJ is a generalization of the Pushed Down Bit Filters for clusters. The objectives of the technique are to reduce (i) the amount of data communicated, (ii) the amount of data spilled to disk during the execution of intermediate joins in the query plan, and (iii) amount of memory used by auxiliary data structures such as bit filters.