A prediction-based parallel replication algorithm in distributed storage system

Authors:
Yijie Wang;Xiaoming Zhang
Affiliations:
National Laboratory for Parallel and Distributed Processing, Institute of Computer, National University of Defense Technology, Changsha, China;National Laboratory for Parallel and Distributed Processing, Institute of Computer, National University of Defense Technology, Changsha, China
Venue:
GCC'05 Proceedings of the 4th international conference on Grid and Cooperative Computing
Year:
2005

Citing 9
Cited 1

A prototype implementation of archival Intermemory

Proceedings of the fourth ACM conference on Digital libraries
OceanStore: an architecture for global-scale persistent storage

ASPLOS IX Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Wide-area cooperative storage with CFS

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
HTTP redirection for replica catalogue lookups in data grids

Proceedings of the 2002 ACM symposium on Applied computing
Evaluation of an Economy-Based File Replication Strategy for a Data Grid

CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
Towards an Economy-Based Optimisation of File Access and Replication on a Data Grid

CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
Data Replication Strategies in Grid Environments

ICA3PP '02 Proceedings of the Fifth International Conference on Algorithms and Architectures for Parallel Processing
Farsite: federated, available, and reliable storage for an incompletely trusted environment

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading

Data mining model based on multi-agent for the intelligent distributed framework

International Journal of Intelligent Information and Database Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data replication can be used to reduce bandwidth consumption and access latency in the distributed system where users require remote access to large data objects. In this paper, according to the intrinsic characteristic of distributed storage system, the prediction-based parallel replication algorithm PPR is proposed. In the PPR, according to the characteristic of spatial data, the data that will be accessed is predicted, then the data is prefetched; during replication, according to the network state, several replicas of a data object are selected, which are of the least access cost; the different parts of the data object are transferred from these replicas, and they are used to make a new replica. The results of performance evaluation show that the PPR can utilize the network bandwidth efficiently, provide high data replication efficiency and substantially better access efficiency, and can avoid the interference between different replications efficiently.