A Grid Services-Oriented Architecture for Efficient Operation of Distributed Data Warehouses on Globus

  • Authors:
  • Pascal Wehrle;Maryvonne Miquel;Anne Tchounikine

  • Affiliations:
  • Lyon Research Center for Images and Intelligent Information, France;Lyon Research Center for Images and Intelligent Information, France;Lyon Research Center for Images and Intelligent Information, France

  • Venue:
  • AINA '07 Proceedings of the 21st International Conference on Advanced Networking and Applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data warehouses store large volumes of data according to a multidimensionalmodelthatprovides afastaccess for online analysis. The constant growth in quantity and complexity of data stored in data warehouses has led to a variety of data warehouse applications on distributed systems. Themain benefits of these architectures are parallelized query execution and higher storage capacities. Computing grids in particular are built to combine a large number of heterogeneous distributed resources. Their lack of centralized control however conflicts with the centralized structure of classical data warehouses. Autonomous datamanagement on grid nodes requires efficient communication during query evaluation. The architecture we present supports a global data localization method with the help of a specialized catalog service. Our workis based on a model for uniqueidentification and efficient local indexing of the warehouse data. Local indexes integrate computable aggregates formaximum utilization of locally materialized data in order to facilitate cost-optimized query execution. The grid services implementing these functionalities are deployed on the GGM project's test environment.