Replicated data management in the grid: the Re:GRIDiT approach

Authors:
Laura Cristiana Voicu;Heiko Schuldt;Yuri Breitbart;Hans-Jörg Schek
Affiliations:
University of Basel, Basel, Switzerland;University of Basel, Basel, Switzerland;Kent University, Kent, OH, Switzerland;ETH Zurich, Zurich, Switzerland
Venue:
Proceedings of the 1st ACM workshop on Data grids for eScience
Year:
2009

Citing 9
Cited 3

Concurrency control and recovery in database systems

Concurrency control and recovery in database systems
Unifying concurrency control and recovery of transactions with semantically rich operations

Theoretical Computer Science - Special issue: database theory
Dynamic Server Selection using Bandwidth Probing in Wide-Area Networks

Dynamic Server Selection using Bandwidth Probing in Wide-Area Networks
Grid Datafarm Architecture for Petascale Data Intensive Computing

CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
Fine-grained replication and scheduling with freshness and correctness guarantees

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Decentralized coordination of transactional processes in peer-to-peer environments

Proceedings of the 14th ACM international conference on Information and knowledge management
Web++: a system for fast and reliable web service

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
FAS: a freshness-sensitive coordination middleware for a cluster of OLAP components
DILIGENT: integrating digital library and Grid technologies for a new Earth observation research infrastructure

International Journal on Digital Libraries

How replicated data management in the cloud can benefit from a data grid protocol: the Re:GRIDiT Approach

Proceedings of the first international workshop on Cloud data management
Load-Aware Dynamic Replication Management in a Data Grid

OTM '09 Proceedings of the Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009 on On the Move to Meaningful Internet Systems: Part I
Replication techniques in data grid environments

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

Grid environments more and more target novel domains such as eScience, eHealth or digital libraries that feature a variety of data-intensive applications. Consequently, issues related to data management in Grids are becoming increasingly important. In terms of data management, the Grid allows keeping a large number of replicas of data objects, possibly with different versions or levels of freshness, to allow for a high degree of availability, reliability and performance so as to best meet the needs of users and applications. At the same time, the seamless integration of replication management into the Grid while taking into account its special characteristics, needs to be done without any central component for managing data or metadata. In this paper, we report on the ongoing Re:GRIDiT project which aims at addressing all the above requirements. Re:GRIDiT distinguishes between potentially many updateable and read-only replicas which can be distributed across a Grid environment. First, Re:GRIDiT provides new protocols for the correct synchronization of concurrent updates to different updateable replicas and their subsequent propagation in a completely distributed way. Second, Re:GRIDiT takes into account the semantics of the data which is managed in the Grid: mutable data can be subject to updates; immutable data, in turn, cannot be changed once created, but may be subject to version control. Third, Re:GRIDiT will be dynamic in a way that according to the current load, new replicas (updateable or read-only) can be created or removed on demand. Fourth, Re:GRIDiT will provide read-only transactions the full flexibility to specify the freshness (for mutable data) or version number (for immutable data) -- which is particularly useful in order to trade accuracy for performance in the access to data in the Grid.