On the optimum checkpoint selection problem
SIAM Journal on Computing
Analysis of a Class of Recovery Procedures
IEEE Transactions on Computers
Checkpointing and Rollback-Recovery for Distributed Systems
IEEE Transactions on Software Engineering - Special issue on distributed systems
Dynamic programming: deterministic and stochastic models
Dynamic programming: deterministic and stochastic models
Optimal checkpointing of real-time tasks
IEEE Transactions on Computers
On the Optimum Checkpoint Interval
Journal of the ACM (JACM)
Performance analysis of checkpointing strategies
ACM Transactions on Computer Systems (TOCS)
Optimization criteria for checkpoint placement
Communications of the ACM
Optimization criteria for checkpoint placement
Communications of the ACM
Performance of rollback recovery systems under intermittent failures
Communications of the ACM
Probability and Statistics with Reliability, Queuing and Computer Science Applications
Probability and Statistics with Reliability, Queuing and Computer Science Applications
An Introduction to Database Systems
An Introduction to Database Systems
A Model of Checkpointing and Recovery with a Specified Number of Transactions between Checkpoints
Performance '83 Proceedings of the 9th International Symposium on Computer Performance Modelling, Measurement and Evaluation
On the Optimal Checkpointing of Critical Tasks and Transaction-Oriented Systems
IEEE Transactions on Software Engineering
A checkpointing recovery approach in a distributed system on the CSMA/CD network
SAC '92 Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing: technological challenges of the 1990's
An On-Line Algorithm for Checkpoint Placement
IEEE Transactions on Computers
A Variational Calculus Approach to Optimal Checkpoint Placement
IEEE Transactions on Computers
Stochastic Models for Performance Analysis of Database Recovery Control
IEEE Transactions on Computers
An Efficient Protocol for Checkpointing Recovery in Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
An Adaptive Checkpointing Protocol to Bound Recovery Time with Message Logging
SRDS '99 Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems
An on-line algorithm for checkpoint placement
ISSRE '96 Proceedings of the The Seventh International Symposium on Software Reliability Engineering
A Dynamic Programming Procedure for Pricing American-Style Asian Options
Management Science
Distribution-Free Checkpoint Placement Algorithms Based on Min-Max Principle
IEEE Transactions on Dependable and Secure Computing
Numerical computation algorithms for sequential checkpoint placement
Performance Evaluation
Analysis of a software system with rejuvenation, restoration and checkpointing
ISAS'08 Proceedings of the 5th international conference on Service availability
File fragmentation over an unreliable channel
INFOCOM'10 Proceedings of the 29th conference on Information communications
Journal of Systems and Software
Checkpointing strategies for parallel jobs
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Checkpointing for the reliability of real-time systems with on-line fault detection
EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
On the checkpointing strategy in desktop grids
IDCS'12 Proceedings of the 5th international conference on Internet and Distributed Computing Systems
Hi-index | 0.01 |
A numerical approach for computing optimal dynamic checkpointing strategies for general rollback and recovery systems is presented. The system is modeled as a Markov renewal decision process. General failure distributions, random checkpointing durations, and reprocessing-dependent recovery times are allowed. The aim is to find a dynamic decision rule to maximize the average system availability over an infinite time horizon. A computational approach to approximate such a rule is proposed. This approach is based on value-iteration stochastic dynamic programming with spline or finite-element approximation of the value and policy functions. Numerical illustrations are provided.