The Internet Operating System: Middleware for Adaptive Distributed Computing
International Journal of High Performance Computing Applications
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Distributed Management of Massive Data: An Efficient Fine-Grain Data Access Scheme
High Performance Computing for Computational Science - VECPAR 2008
A Software Transactional Memory Service for Grids
ICA3PP '09 Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing
Node-capability-aware replica management for peer-to-peer grids
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Database replication in large scale systems: optimizing the number of replicas
Proceedings of the 2009 EDBT/ICDT Workshops
A DSM-based fragmented data sharing framework for grids
Future Generation Computer Systems
Using failure injection mechanisms to experiment and evaluate a grid failure detector
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
Towards a transparent data access model for the GridRPC paradigm
HiPC'07 Proceedings of the 14th international conference on High performance computing
Vigne: towards a self-healing grid operating system
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Hi-index | 0.00 |
This paper addresses the challenge of transparent data sharing within computing Grids built as cluster federations. On such platforms, the availability of storage resources may change in a dynamic way, often due to hardware failures. We focus on the problem of handling the consistency of replicated data in the presence of failures. We propose a software architecture which decouples consistency management from fault tolerance management. We illustrate this architecture with a case study showing how to design a consistency protocol using fault-tolerant building blocks. As a proof of concept, we describe a prototype implementation of this protocol within JUXMEM, a software experimental platform for Grid data sharing, and we report on a preliminary experimental evaluation of the proposed approach. Copyright © 2006 John Wiley & Sons, Ltd.