PVM: a framework for parallel distributed computing
Concurrency: Practice and Experience
Distributed snapshots: determining global states of distributed systems
ACM Transactions on Computer Systems (TOCS)
Application level fault tolerance in heterogeneous networks of workstations
Journal of Parallel and Distributed Computing
Quasi-asynchronous migration: a novel migration protocol for PVM tasks
ACM SIGOPS Operating Systems Review
Using MPI (2nd ed.): portable parallel programming with the message-passing interface
Using MPI (2nd ed.): portable parallel programming with the message-passing interface
GRAPNEL to C translation in the GRADE environment
Parallel program development for cluster computing
DynamicPVM - Dynamic Load Balancing on Parallel Systems
HPCN Europe 1994 Proceedings of the nternational Conference and Exhibition on High-Performance Computing and Networking Volume II: Networking and Tools
IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
MPI-2: Extending the Message-Passing Interface
Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing - Volume I
Fail-Safe PVM: A Portable Package for Distributed Programming with Transparent Recovery
Fail-Safe PVM: A Portable Package for Distributed Programming with Transparent Recovery
MPVM: A Migration Transparent Version of PVM
MPVM: A Migration Transparent Version of PVM
Libckpt: Transparent Checkpointing under Unix
Libckpt: Transparent Checkpointing under Unix
The Anatomy of the Grid: Enabling Scalable Virtual Organizations
International Journal of High Performance Computing Applications
A policy-based approach for strong mobility of composed Web services
Service Oriented Computing and Applications
Hi-index | 0.00 |
This paper introduces a novel approach in parallel checkpointing aimed at supporting fault-tolerance and migration among clusters of a ClusterGrid environment with various middleware components. Based on an architectural analysis, compatibility and integrity requirements are identified and corresponding conditions are established. Some of the available checkpointing systems are checked against the conditions in order to examine their conformity. Finally, a novel checkpointing approach is defined and the Parallel Grid Runtime and Application Development Environment (P-GRADE) Grid Programming Tool is adapted.