Modeling replication strategies in data grid systems with arbitrary clustered demands

  • Authors:
  • Jianjin Jiang;Guangwen Yang;Dingxing Wang

  • Affiliations:
  • Tsinghua University, Beijing, China;Tsinghua University, Beijing, China;Tsinghua University, Beijing, China

  • Venue:
  • Proceedings of the 3rd international conference on Scalable information systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper considers the relationship between request distribution and replica distribution in data grid when request exhibits arbitrary clustered demands. We first give formal model of replication strategies in data grid system. Second, we investigate what is optimal way at the objective of minimizing average access latency to replicate data when request exhibits arbitrary clustered demands. We explain why replicas should be replicated uniformly when request is uniformly distributed in a sub grid in the sense of optimal strategy. Then we investigate the relationship between different files in a sub grid. Furthermore, we analyze the case when all sub grids are equal-sized and conclude that when request is uniformly distributed in system, replicas should be uniformly distributed in system too. Finally, we give an optimal strategy when sub grids are not equal-sized and different sub grids exhibit different request clustering patterns. Compared with some popular strategies, the optimal strategy has some advantages of lower wide area network bandwidth requirement and lower average access latency. Simulation results validate the effectiveness of optimal strategy.