Queueing Analysis of Fault-Tolerant Computer Systems
IEEE Transactions on Software Engineering
A note on the effect of preemptive policies on the stability of a priority queue
Information Processing Letters
Performance Modeling Based on Real Data: A Case Study
IEEE Transactions on Computers - Fault-Tolerant Computing
Error analysis of a Laplace transform inversion procedure
SIAM Journal on Numerical Analysis
System performance in a failure prone environment
System performance in a failure prone environment
Preemptive repeat identical transitions in Markov regenerative stochastic Petri nets
PNPM '95 Proceedings of the Sixth International Workshop on Petri Nets and Performance Models
IEEE Transactions on Computers
ICSOC '07 Proceedings of the 5th international conference on Service-Oriented Computing
FTCS'95 Proceedings of the Twenty-Fifth international conference on Fault-tolerant computing
Job completion time on a virtualized server with software rejuvenation
ACM Journal on Emerging Technologies in Computing Systems (JETC) - Special Issue on Reliability and Device Degradation in Emerging Technologies and Special Issue on WoSAR 2011
Hi-index | 14.98 |
The authors describe a technique for computing the distribution of the completion time of a program on a server subject to failure and repair. Several realistic aspects of the system are included in the model. The server behavior is modeled by a semi-Markov process in order to accommodate nonexponential repair-time distributions. More importantly, the effect on the job completion time of the work lost due to the occurrence of a server failure is modeled. They derive a closed-form expression for the Laplace-Stieltjes transform (LST) of the time to completion distribution of programs on such systems. They then describe an effective numerical procedure for computing the completion time distribution. They show how these results apply to the analysis of different computer system structures and organizations of fault-tolerant systems. Finally, they use numerical solution methods to find the distribution of time to completion on several systems.