An integrated query optimization system for data grids

  • Authors:
  • Srikumar Krishnamoorthy;Avdhoot Kishore Saple;Prahalad Haldhoderi Achutharao

  • Affiliations:
  • Infosys Technologies Ltd, Bangalore, India;Infosys Technologies Ltd, Bangalore, India;Infosys Technologies Ltd, Bangalore, India

  • Venue:
  • COMPUTE '08 Proceedings of the 1st Bangalore Annual Compute Conference
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The disparate and geographically distributed data sources in an enterprise can be integrated using distributed computing technologies such as data grids. The real challenge involved in such data integration efforts is in the design and development of the distributed query processing engine that lie beneath such integrated systems. In the current literature, distributed query processing and optimization is carried out in three distinct phases namely, (1) creation of single node plan, (2) generation of parallel plan, and (3) optimal site selection for plan execution. As considering the three phases in isolation leads to sub-optimal plans, the paper proposes a new distributed query optimization model that integrates all the three phases of the query optimization. This paper also presents different heuristic approaches for solving the proposed integrated distributed query processing problem. Furthermore, the presented system is integrated with a data grid solution and several real-time experiments are conducted to demonstrate its usefulness.