Calculating Cumulative Operational Time Distributions of Repairable Computer Systems
IEEE Transactions on Computers - The MIT Press scientific computation series
Optimal checkpointing of real-time tasks
IEEE Transactions on Computers
Software for an implementation of Weeks' method for the inverse Laplace transform
ACM Transactions on Mathematical Software (TOMS)
A Measure of Guaranteed Availability and its Numerical Evaluation
IEEE Transactions on Computers
Computing Optimal Checkpointing Strategies for Rollback and Recovery Systems
IEEE Transactions on Computers - Fault-Tolerant Computing
On the Optimum Checkpoint Interval
Journal of the ACM (JACM)
Optimization criteria for checkpoint placement
Communications of the ACM
Optimization criteria for checkpoint placement
Communications of the ACM
Effects of the checkpoint interval on time and space in time warp
ACM Transactions on Modeling and Computer Simulation (TOMACS)
A Variational Calculus Approach to Optimal Checkpoint Placement
IEEE Transactions on Computers
Stochastic Models for Performance Analysis of Database Recovery Control
IEEE Transactions on Computers
Availability and Performance Evaluation of Database Systems under Periodic Checkpoints
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Distribution-Free Checkpoint Placement Algorithms Based on Min-Max Principle
IEEE Transactions on Dependable and Secure Computing
Analytical study of migration-enhanced fault tolerance for long-running applications in IFR systems
International Journal of Parallel, Emergent and Distributed Systems
Numerical computation algorithms for sequential checkpoint placement
Performance Evaluation
ACM SIGMETRICS Performance Evaluation Review
Analysis of a software system with rejuvenation, restoration and checkpointing
ISAS'08 Proceedings of the 5th international conference on Service availability
Journal of Systems and Software
Checkpointing for the reliability of real-time systems with on-line fault detection
EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
Hi-index | 0.01 |
The probability distribution of the overhead caused by the use of the checkpointing rollback recovery technique is evaluated in both cases of a single critical task and of an overall transaction-oriented system. This distribution is obtained in Laplace-Stieltjes transform form, from which all the moments can be easily calculated. Alternatively, inversion methods can be used to evaluate the distribution. The authors propose checkpointing strategies based on the above distribution in order to optimize performance criteria motivated, in the case of critical tasks, by real time constraints, and in the case of transaction-oriented systems, by the need of guaranteeing the users about the maximum system unavailability.