Efficient and flexible fault tolerance and migration of scientific simulations using CUMULVS
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Virtual-machine-based heterogeneous checkpointing
Software—Practice & Experience
IEEE Intelligent Systems
Debugging Large-Scale, Long-Running Parallel Programs
ICCS '02 Proceedings of the International Conference on Computational Science-Part II
Virtual Machine Based Heterogeneous Checkpointing
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Shortcut Replay: A Replay Technique for Debugging Long-Running Parallel Programs
ASIAN '02 Proceedings of the7th Asian Computing Science Conference on Advances in Computing Science: Internet Computing and Modeling, Grid Computing, Peer-to-Peer Computing, and Cluster
System Checkpointing Using Reflection and Program Analysis
REFLECTION '01 Proceedings of the Third International Conference on Metalevel Architectures and Separation of Crosscutting Concerns
Error detection in large-scale parallel programs with long runtimes
Future Generation Computer Systems - Tools for program development and analysis
Quantifying rollback propagation in distributed checkpointing
Journal of Parallel and Distributed Computing
Practical dynamic software updating for C
Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Log-based rollback recovery without checkpoints of shared memory in software DSM
The Journal of Supercomputing
In-network fault tolerance in networked sensor systems
DIWANS '06 Proceedings of the 2006 workshop on Dependability issues in wireless ad hoc networks and sensor networks
Kernel support for zero-loss Internet service restart
Software—Practice & Experience
Model-based performance evaluation of distributed checkpointing protocols
Performance Evaluation
Transparent checkpoint-restart of multiple processes on commodity operating systems
ATC'07 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference
Research on Dynamic Updating of Grid Service
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part II
Distributed Implementation of OpenMP Based on Checkpointing Aided Parallel Execution
IWOMP '07 Proceedings of the 3rd international workshop on OpenMP: A Practical Programming Model for the Multi-Core Era
A Checkpointing Method with Small Checkpoint Latency
IEICE - Transactions on Information and Systems
Algorithm 897: VTDIRECT95: Serial and parallel codes for the global optimization algorithm direct
ACM Transactions on Mathematical Software (TOMS)
VECPAR'02 Proceedings of the 5th international conference on High performance computing for computational science
Record and transplay: partial checkpointing for replay debugging across heterogeneous systems
Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Record and transplay: partial checkpointing for replay debugging across heterogeneous systems
ACM SIGMETRICS Performance Evaluation Review - Performance evaluation review
Toward a distributed implementation of openMP using CAPE
PaCT'07 Proceedings of the 9th international conference on Parallel Computing Technologies
Checkpointing aided parallel execution model and analysis
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Future Generation Computer Systems
Hi-index | 0.00 |