Parallel OLAP query processing in database clusters with data replication

  • Authors:
  • Alexandre A. Lima;Camille Furtado;Patrick Valduriez;Marta Mattoso

  • Affiliations:
  • School of Science and Technology, Unigranrio University, Rio de Janeiro, Brazil;COPPE, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil;INRIA, Nantes, France;COPPE, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil

  • Venue:
  • Distributed and Parallel Databases
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the problem of improving the performance of OLAP applications in a database cluster (DBC), which is a low cost and effective parallel solution for query processing. Current DBC solutions for OLAP query processing provide for intra-query parallelism only, at the cost of full replication of the database. In this paper, we propose more efficient distributed database design alternatives which combine physical/virtual partitioning with partial replication. We also propose a new load balancing strategy that takes advantage of an adaptive virtual partitioning to redistribute the load to the replicas. Our experimental validation is based on the implementation of our solution on the SmaQSS DBC middleware prototype. Our experimental results using the TPC-H benchmark and a 32-node cluster show very good speedup.