A Fast Replica Selection Algorithm for Data Grid

  • Authors:
  • Dafei Yin;Bin Chen;Yu Fang

  • Affiliations:
  • Peking University, Beijing;Peking University, Beijing;Peking University, Beijing

  • Venue:
  • COMPSAC '07 Proceedings of the 31st Annual International Computer Software and Applications Conference - Volume 01
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data Grid, which consists of several geographically distributed datacenters linked by high speed network, is an ideal platform for the data-intensive and computingintensive scientific computing. Besides improving the computing performance and the data processing capabilities, the replication service among the nodes improves failure resistance and increase system availability. Replica selection is one important problem in replication optimization, because Grid application may need to retrieve data from many distributed nodes and do computation on their own local machine in parallel. In this paper, we focus on how to determine an appropriate set of replicas that at least cover the data, and farthest utilize the system parallel computing capacity. As we believe there is a trade-off between increasing parallelism and reducing redundancy as more replicas involved in computation, we put forward a fast replica selection algorithm inspired by the Utility Theory in economics to balance the two conflict demands.