On the optimum checkpoint selection problem
SIAM Journal on Computing
Computing Optimal Checkpointing Strategies for Rollback and Recovery Systems
IEEE Transactions on Computers - Fault-Tolerant Computing
Optimum checkpoints with age dependent failures
Acta Informatica
Comparative Analysis of Different Models of Checkpointing and Recovery
IEEE Transactions on Software Engineering
On the Optimal Checkpointing of Critical Tasks and Transaction-Oriented Systems
IEEE Transactions on Software Engineering
Optimal checkpointing policies using the checkpointing density
Journal of Information Processing
Minimizing completion time of a program by checkpointing and rejuvenation
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Optimal software rejuvenation for tolerating soft failures
Performance Evaluation
An On-Line Algorithm for Checkpoint Placement
IEEE Transactions on Computers
Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing Scheme
IEEE Transactions on Computers
Analysis of Preventive Maintenance in Transactions Based Software Systems
IEEE Transactions on Computers
On-board preventive maintenance: a design-oriented analytic study for long-life applications
IPDS '98 Proceedings of the third IEEE international performance and dependability symposium on International performance and dependability symposium
On the Optimum Checkpoint Interval
Journal of the ACM (JACM)
Performance of rollback recovery systems under intermittent failures
Communications of the ACM
A first order approximation to the optimum checkpoint interval
Communications of the ACM
Analysis and implementation of software rejuvenation in cluster systems
Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
A Variational Calculus Approach to Optimal Checkpoint Placement
IEEE Transactions on Computers
Fine grained software degradation models for optimal rejuvenation policies
Performance Evaluation
Monitoring Smoothly Degrading Systems for Increased Dependability
Empirical Software Engineering
Stochastic Models for Performance Analysis of Database Recovery Control
IEEE Transactions on Computers
PNPM '99 Proceedings of the The 8th International Workshop on Petri Nets and Performance Models
Availability Models with Age-Dependent Checkpointing
SRDS '02 Proceedings of the 21st IEEE Symposium on Reliable Distributed Systems
Dependability Analysis of a Client/Server Software System with Rejuvenation
ISSRE '02 Proceedings of the 13th International Symposium on Software Reliability Engineering
Software Rejuvenation: Analysis, Module and Applications
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
A Dynamic Checkpointing Scheme Based on Reinforcement Learning
PRDC '04 Proceedings of the 10th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC'04)
A Comprehensive Model for Software Rejuvenation
IEEE Transactions on Dependable and Secure Computing
Distribution-Free Checkpoint Placement Algorithms Based on Min-Max Principle
IEEE Transactions on Dependable and Secure Computing
Optimal Checkpoint Placement with Equality Constraints
DASC '06 Proceedings of the 2nd IEEE International Symposium on Dependable, Autonomic and Secure Computing
Behavioral Analysis of a Fault-Tolerant Software System with Rejuvenation
IEICE - Transactions on Information and Systems
Performability analysis of clustered systems with rejuvenation under varying workload
Performance Evaluation
Analysis of Restart Mechanisms in Software Systems
IEEE Transactions on Software Engineering
ISAS '07 Proceedings of the 4th international symposium on Service Availability
Proactive management of software aging
IBM Journal of Research and Development
Optimizing preventive service of software products
IBM Journal of Research and Development
Analysis of a software system with rejuvenation, restoration and checkpointing
ISAS'08 Proceedings of the 5th international conference on Service availability
A measurement study of the interplay between application level restart and transport protocol
ISAS'04 Proceedings of the First international conference on Service Availability
Analysis of a service degradation model with preventive rejuvenation
ISAS'06 Proceedings of the Third international conference on Service Availability
Analytic models for rollback and recovery strategies in data base systems
IEEE Transactions on Software Engineering
Environmental diversity techniques of software systems
FGIT'11 Proceedings of the Third international conference on Future Generation Information Technology
Hi-index | 0.00 |
This paper examines comprehensive evaluation of aperiodic time-based checkpointing and rejuvenation schemes maximizing the steady-state system availability in an operational software system. We consider two kinds of maintenance policies: checkpointing prior to rejuvenating (CPTR) and rejuvenating prior to checkpointing (RPTC). These schemes are complementary from each other to schedule checkpoints and rejuvenation points. In addition, under a periodic full maintenance operation, we show that aperiodic checkpointing or rejuvenation scheme is optimal to maximize the steady-state system availability by applying the dynamic programming. In numerical examples, CPTR and RPTC are comparatively examined with same overhead parameters, and the effects of CPTR and RPTC on maximizing the steady-state system availability are investigated.