PDDRA: A new pre-fetching based dynamic data replication algorithm in data grids

Authors:
Nazanin Saadat;Amir Masoud Rahmani
Affiliations:
-;-
Venue:
Future Generation Computer Systems
Year:
2012

Citing 14
Cited 2

Text compression

Text compression
File and Object Replication in Data Grids

Cluster Computing
Identifying Dynamic Replication Strategies for a High-Performance Data Grid

GRID '01 Proceedings of the Second International Workshop on Grid Computing
The Case for Efficient File Access Pattern Modeling

HOTOS '99 Proceedings of the The Seventh Workshop on Hot Topics in Operating Systems
The Anatomy of the Grid: Enabling Scalable Virtual Organizations

International Journal of High Performance Computing Applications
Complete and fragmented replica selection and retrieval in Data Grids

Future Generation Computer Systems
Popularity-Driven Dynamic Replica Placement in Hierarchical Data Grids

PDCAT '08 Proceedings of the 2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies
A study on performance of dynamic file replication algorithms for real-time file access in Data Grids

Future Generation Computer Systems
Dynamic replication algorithms for the multi-tier Data Grid

Future Generation Computer Systems - Special issue: Parallel computing technologies
The complexity of static data replication in data grids

Parallel Computing
FIRE: A File Reunion Based Data Replication Strategy for Data Grids

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Dynamic replication in a data grid using a Modified BHR Region Based Algorithm

Future Generation Computer Systems
PHFS: A dynamic replication method, to decrease access latency in the multi-tier data grid

Future Generation Computer Systems
Algorithms for automatic data replication in grid environment

PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics

Enhanced Dynamic Hierarchical Replication and Weighted Scheduling Strategy in Data Grid

Journal of Parallel and Distributed Computing
Job scheduling and dynamic data replication in data grid environment

The Journal of Supercomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In recent years, grid technology has had such a fast growth that it has been used in many scientific experiments and research centers. A large number of storage elements and computational resources are combined to generate a grid which gives us shared access to extra computing power. In particular, data grid deals with data intensive applications and provides intensive resources across widely distributed communities. Data replication is an efficient way for distributing replicas among the data grids, making it possible to access similar data in different locations of the data grid. Replication reduces data access time and improves the performance of the system. In this paper, we propose a new dynamic data replication algorithm named PDDRA that optimizes the traditional algorithms. Our proposed algorithm is based on an assumption: members in a VO (Virtual Organization) have similar interests in files. Based on this assumption and also file access history, PDDRA predicts future needs of grid sites and pre-fetches a sequence of files to the requester grid site, so the next time that this site needs a file, it will be locally available. This will considerably reduce access latency, response time and bandwidth consumption. PDDRA consists of three phases: storing file access patterns, requesting a file and performing replication and pre-fetching and replacement. The algorithm was tested using a grid simulator, OptorSim developed by European Data Grid projects. The simulation results show that our proposed algorithm has better performance in comparison with other algorithms in terms of job execution time, effective network usage, total number of replications, hit ratio and percentage of storage filled.