On Coordinated Checkpointing in Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Iterative Methods for Sparse Linear Systems
Iterative Methods for Sparse Linear Systems
The JXTA performance model and evaluation
Future Generation Computer Systems - Special issue: P2P computing and interaction with grids
Future Generation Computer Systems - Special issue: P2P computing and interaction with grids
Peer-to-Peer and fault-tolerance: Towards deployment-based technical services
Future Generation Computer Systems
Libckpt: transparent checkpointing under Unix
TCON'95 Proceedings of the USENIX 1995 Technical Conference Proceedings
The Waveform Relaxation Method for Time-Domain Analysis of Large Scale Integrated Circuits
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Editorial: Special Section: Grid and Pervasive Computing 2009
Future Generation Computer Systems
High throughput computing over peer-to-peer networks
Future Generation Computer Systems
Hi-index | 0.00 |
This article presents JACEP2P-V2, a Java environment dedicated to designing parallel iterative asynchronous algorithms (with direct communications between nodes) and executing them on global computing architectures or distributed clusters composed of a large number of volatile heterogeneous distant computing nodes. This platform is fault tolerant, multi-threaded and completely decentralized. In this paper, we describe the different components of JACEP2P-V2 and the various mechanisms used for scalability and fault tolerance purposes. The performance of this improved platform is evaluated in many experiments that compare it to JACEP2P while solving, over a volatile distributed architecture, a 3D advection-diffusion equations system. We also test the scalability of JACEP2P-V2 and its compatibility with various kinds of problems by solving a large instance of the 3D advection-diffusion problem using more than 1000 cores and by solving the NAS parallel benchmark (GC).