Handling big dimensions in distributed data warehouses using the DWS technique

  • Authors:
  • Marco Costa;Henrique Madeira

  • Affiliations:
  • Critical Software S.A., Coimbra, Portugal;University of Coimbra, Portugal

  • Venue:
  • Proceedings of the 7th ACM international workshop on Data warehousing and OLAP
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The DWS (Data Warehouse Striping) technique allows the distribution of large data warehouses through a cluster of computers. The data partitioning approach partition the facts tables through all nodes and replicates the dimension tables. The replication of the dimension tables creates a limitation to the applicability of the DWS technique to data warehouses with big dimensions. This paper proposes a strategy to handle large dimensions in a distributed DWS system and evaluates the proposed strategy experimentally. With the proposed strategy the performance speed up and scale up obtained in the DWS technique are not affected by the presence of big dimensions. Furthermore, it extends the scope of the technique to queries that browse big dimensions that can also benefit of the performance increase of the DWS technique.