On the optimum checkpoint selection problem
SIAM Journal on Computing
Computing Optimal Checkpointing Strategies for Rollback and Recovery Systems
IEEE Transactions on Computers - Fault-Tolerant Computing
Optimum checkpoints with age dependent failures
Acta Informatica
Comparative Analysis of Different Models of Checkpointing and Recovery
IEEE Transactions on Software Engineering
On the Optimal Checkpointing of Critical Tasks and Transaction-Oriented Systems
IEEE Transactions on Software Engineering
Optimal checkpointing policies using the checkpointing density
Journal of Information Processing
An On-Line Algorithm for Checkpoint Placement
IEEE Transactions on Computers
Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing Scheme
IEEE Transactions on Computers
On the Optimum Checkpoint Interval
Journal of the ACM (JACM)
Performance of rollback recovery systems under intermittent failures
Communications of the ACM
A first order approximation to the optimum checkpoint interval
Communications of the ACM
A Variational Calculus Approach to Optimal Checkpoint Placement
IEEE Transactions on Computers
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Stochastic Models for Performance Analysis of Database Recovery Control
IEEE Transactions on Computers
Optimal Checkpointing and Rollback Strategies with Media Failures: Statistical Estimation Algorithms
PRDC '99 Proceedings of the 1999 Pacific Rim International Symposium on Dependable Computing
Availability Models with Age-Dependent Checkpointing
SRDS '02 Proceedings of the 21st IEEE Symposium on Reliable Distributed Systems
A Dynamic Checkpointing Scheme Based on Reinforcement Learning
PRDC '04 Proceedings of the 10th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC'04)
Min-Max Checkpoint Placement under Incomplete Failure Information
DSN '04 Proceedings of the 2004 International Conference on Dependable Systems and Networks
Proactive management of software aging
IBM Journal of Research and Development
Numerical computation algorithms for sequential checkpoint placement
Performance Evaluation
Proceedings of the 2009 workshop on Resiliency in high performance
Analysis of a software system with rejuvenation, restoration and checkpointing
ISAS'08 Proceedings of the 5th international conference on Service availability
Journal of Systems and Software
Checkpointing strategies for parallel jobs
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Environmental diversity techniques of software systems
FGIT'11 Proceedings of the Third international conference on Future Generation Information Technology
Checkpoint scheduling model for optimality
Information Processing Letters
Hi-index | 0.00 |
In this paper, we consider two kinds of sequential checkpoint placement problems with infinite/finite time horizon. For these problems, we apply approximation methods based on the variational principle and develop computation algorithms to derive the optimal checkpoint sequence approximately. Next, we focus on the situation where the knowledge on system failure is incomplete, i.e., the system failure time distribution is unknown. We develop the so-called min-max checkpoint placement methods to determine the optimal checkpoint sequence under an uncertain circumstance in terms of the system failure time distribution. In numerical examples, we investigate quantitatively the proposed distribution-free checkpoint placement methods, and refer to their potential applicability in practice.