Analysis of Preventive Maintenance in Transactions Based Software Systems
IEEE Transactions on Computers
Performance of Computer Communication Systems: A Model-Based Approach
Performance of Computer Communication Systems: A Model-Based Approach
A survey of rollback-recovery protocols in message-passing systems
ACM Computing Surveys (CSUR)
Stochastic Modeling Formalisms for Dependability, Performance and Performability
Performance Evaluation: Origins and Directions
Petri Net Modelling and Performability Evaluation with TimeNET 3.0
TOOLS '00 Proceedings of the 11th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
Software Rejuvenation: Analysis, Module and Applications
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
A Generic Availability Model for Clustered Computing Systems
PRDC '01 Proceedings of the 2001 Pacific Rim International Symposium on Dependable Computing
Distributed Simulation of Colored Stochastic Petri Nets With TimeNET 4.0
QEST '06 Proceedings of the 3rd international conference on the Quantitative Evaluation of Systems
Service Availability of Systems with Failure Prevention
APSCC '08 Proceedings of the 2008 IEEE Asia-Pacific Services Computing Conference
Modeling user-perceived service availability
ISAS'05 Proceedings of the Second international conference on Service Availability
Towards IT systems capable of managing their health
FOCS'10 Proceedings of the 16th Monterey conference on Foundations of computer software: modeling, development, and verification of adaptive systems
Hi-index | 0.00 |
In this paper we present and analyse a coloured stochastic Petri net model of a redundant fault-tolerant system. As our measure of interest we are interested in a dependability metric, i.e., service availability. Service availability is defined as the number of successfully completed jobs relative to the total number of arrived jobs. This paper is the first step towards a comprehensive comparison of redundancy and rejuvenation, i.e., the preventive restart of servers when studying service availability. The question we strive to answer in this paper is whether and to what degree additional redundant servers can increase service availability in all load scenarios. We find that the first redundant server improves service availability by almost 90% in a highly loaded system, while adding a second and third redundant server yields further but much lower improvement. Under low load the benefit of additional servers is not as pronounced.