Branch replication scheme: A new model for data replication in large scale data grids

Authors:
José M. Pérez;Félix García-Carballeira;Jesús Carretero;Alejandro Calderón;Javier Fernández
Affiliations:
Computer Architecture Group, Computer Science Department, Universidad Carlos III de Madrid, Leganes, Madrid, Spain;Computer Architecture Group, Computer Science Department, Universidad Carlos III de Madrid, Leganes, Madrid, Spain;Computer Architecture Group, Computer Science Department, Universidad Carlos III de Madrid, Leganes, Madrid, Spain;Computer Architecture Group, Computer Science Department, Universidad Carlos III de Madrid, Leganes, Madrid, Spain;Computer Architecture Group, Computer Science Department, Universidad Carlos III de Madrid, Leganes, Madrid, Spain
Venue:
Future Generation Computer Systems
Year:
2010

Citing 25
Cited 9

Lazy replication: exploiting the semantics of distributed services

PODC '90 Proceedings of the ninth annual ACM symposium on Principles of distributed computing
The dangers of replication and a solution

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
The IceCube approach to the reconciliation of divergent replicas

Proceedings of the twentieth annual ACM symposium on Principles of distributed computing
High Performance Mass Storage and Parallel I/O: Technologies and Applications

High Performance Mass Storage and Parallel I/O: Technologies and Applications
Reliable File Transfer in Grid Environments

LCN '02 Proceedings of the 27th Annual IEEE Conference on Local Computer Networks
Giggle: a framework for constructing scalable replica location services

Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing

MSS '01 Proceedings of the Eighteenth IEEE Symposium on Mass Storage Systems and Technologies
File and Object Replication in Data Grids

HPDC '01 Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing
Performance and Scalability of a Replica Location Service

HPDC '04 Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing
A Peer-to-Peer Replica Location Service Based on a Distributed Hash Table

Proceedings of the 2004 ACM/IEEE conference on Supercomputing
File-based replica management

Future Generation Computer Systems
Adaptable Replica Consistency Service for Data Grids

ITNG '06 Proceedings of the Third International Conference on Information Technology: New Generations
Optimal Replica Placement Strategy for Hierarchical Data Grid Systems

CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
Replica Placement Design with Static Optimality and Dynamic Maintainability

CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
The complexity of static data replication in data grids

Parallel Computing
Job scheduling and data replication on data grids

Future Generation Computer Systems
A global and parallel file system for grids

Future Generation Computer Systems - Special section: Data mining in grid computing environments
A One-Way File Replica Consistency Model in Data Grids

APSCC '07 Proceedings of the The 2nd IEEE Asia-Pacific Service Computing Conference
Globus GridFTP: what's new in 2007

Proceedings of the first international conference on Networks for grid applications
FRCS: A File Replication and Consistency Service in Data Grids

MUE '08 Proceedings of the 2008 International Conference on Multimedia and Ubiquitous Engineering
Managing Petabyte-Scale Storage for the ATLAS Tier-1 Centre at TRIUMF

HPCS '08 Proceedings of the 2008 22nd International Symposium on High Performance Computing Systems and Applications
A dynamic weighted data replication strategy in data grids

AICCSA '08 Proceedings of the 2008 IEEE/ACS International Conference on Computer Systems and Applications
A fair replica placement for parallel download on cluster grid

NBiS'07 Proceedings of the 1st international conference on Network-based information systems
Data replication techniques for data-intensive applications

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part IV
Implementation of replication methods in the grid environment

EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing

PHFS: A dynamic replication method, to decrease access latency in the multi-tier data grid

Future Generation Computer Systems
A survey of dynamic replication strategies for improving data availability in data grids

Future Generation Computer Systems
PFRF: An adaptive data replication algorithm based on star-topology data grids

Future Generation Computer Systems
Replication techniques in data grid environments

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part II
Binary vote assignment grid quorum for managing fragmented database

ICICA'12 Proceedings of the Third international conference on Information Computing and Applications
Cloud Computing: Locally Sub-Clouds instead of Globally One Cloud

International Journal of Cloud Applications and Computing
A content aware and name based routing network speed up system

ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
Job scheduling and dynamic data replication in data grid environment

The Journal of Supercomputing
Dynamic replica placement and selection strategies in data grids- A comprehensive survey

Journal of Parallel and Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data replication is a practical and effective method to achieve efficient and fault-tolerant data access in grids. Traditionally, data replication schemes maintain an entire replica in each site where a file is replicated, providing a read-only model. These solutions require huge storage resources to store the whole set of replicas and do not allow efficient data modification to avoid the consistency problem. In this paper we propose a new replication method, called the Branch Replication Scheme (BRS), that provides three main advantages over traditional approaches: optimizing storage usage, by creating subreplicas; increasing data access performance, by applying parallel I/O techniques; and providing the possibility to modify the replicas, by maintaining consistency among updates in an efficient way. An analytical model of the replication scheme, naming system, and replica updating scheme are formally described in the paper. Using this model, operations such as reading, writing, or updating a replica are analyzed. Simulation results demonstrate the feasibility of BRS, as they show that the new replication algorithm increases data access performance, compared with popular replication schemes such as hierarchical and server-directed replication, which are commonly used in current data grids.