Study of scheduling strategies in a dynamic data grid environment

  • Authors:
  • R. A. Dheepak;Shakeb Ali;Shubhashis Sengupta;Anirban Chakrabarti

  • Affiliations:
  • Software Engineering and Technology Laboratory, Infosys Technologies Ltd., Bangalore, India;Software Engineering and Technology Laboratory, Infosys Technologies Ltd., Bangalore, India;Software Engineering and Technology Laboratory, Infosys Technologies Ltd., Bangalore, India;Software Engineering and Technology Laboratory, Infosys Technologies Ltd., Bangalore, India

  • Venue:
  • IWDC'04 Proceedings of the 6th international conference on Distributed Computing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data grids seek to harness geographically distributed resources for large-scale data-intensive problems. Such problems involve loosely coupled jobs and large data sets mostly distributed geographically. Data grids have found applications in scientific research, in the field of high-energy Physics, Life Sciences etc. The issues that need to be considered in the data grid research area include: resource management including computation management and data management. Computation management include scheduling of jobs, scalability, response time involved in such scheduling, while data management include data replication in selected sited, data movement when required. Therefore, scheduling and replication assumes great importance in a data grid environment. In this paper, we have developed several scheduling strategies based on a developed replication strategy. The scheduling strategies are called Matching based Scheduling (MJS), Cost base Scheduling (CJS) and Latency based Scheduling (LJS). Among these, LJS and CJS perform similarly and MJS performs worse than both of them.