A prediction-based parallel replication algorithm in distributed storage system

  • Authors:
  • Yijie Wang;Xiaoming Zhang

  • Affiliations:
  • National Laboratory for Parallel and Distributed Processing, Institute of Computer, National University of Defense Technology, Changsha, China;National Laboratory for Parallel and Distributed Processing, Institute of Computer, National University of Defense Technology, Changsha, China

  • Venue:
  • GCC'05 Proceedings of the 4th international conference on Grid and Cooperative Computing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data replication can be used to reduce bandwidth consumption and access latency in the distributed system where users require remote access to large data objects. In this paper, according to the intrinsic characteristic of distributed storage system, the prediction-based parallel replication algorithm PPR is proposed. In the PPR, according to the characteristic of spatial data, the data that will be accessed is predicted, then the data is prefetched; during replication, according to the network state, several replicas of a data object are selected, which are of the least access cost; the different parts of the data object are transferred from these replicas, and they are used to make a new replica. The results of performance evaluation show that the PPR can utilize the network bandwidth efficiently, provide high data replication efficiency and substantially better access efficiency, and can avoid the interference between different replications efficiently.