A Petri net model for service availability in redundant computing systems

Authors:
Felix Salfner;Katinka Wolter
Affiliations:
Humboldt-Universität zu Berlin, Unter den Linden, Berlin, Germany;Freie Universität Berlin, Takustr, Berlin, Germany
Venue:
Winter Simulation Conference
Year:
2009

Citing 10
Cited 1

Analysis of Preventive Maintenance in Transactions Based Software Systems

IEEE Transactions on Computers
Performance of Computer Communication Systems: A Model-Based Approach

Performance of Computer Communication Systems: A Model-Based Approach
A survey of rollback-recovery protocols in message-passing systems

ACM Computing Surveys (CSUR)
Stochastic Modeling Formalisms for Dependability, Performance and Performability

Performance Evaluation: Origins and Directions
Petri Net Modelling and Performability Evaluation with TimeNET 3.0

TOOLS '00 Proceedings of the 11th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
Software Rejuvenation: Analysis, Module and Applications

FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
A Generic Availability Model for Clustered Computing Systems

PRDC '01 Proceedings of the 2001 Pacific Rim International Symposium on Dependable Computing
Distributed Simulation of Colored Stochastic Petri Nets With TimeNET 4.0

QEST '06 Proceedings of the 3rd international conference on the Quantitative Evaluation of Systems
Service Availability of Systems with Failure Prevention

APSCC '08 Proceedings of the 2008 IEEE Asia-Pacific Services Computing Conference
Modeling user-perceived service availability

ISAS'05 Proceedings of the Second international conference on Service Availability

Towards IT systems capable of managing their health

FOCS'10 Proceedings of the 16th Monterey conference on Foundations of computer software: modeling, development, and verification of adaptive systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present and analyse a coloured stochastic Petri net model of a redundant fault-tolerant system. As our measure of interest we are interested in a dependability metric, i.e., service availability. Service availability is defined as the number of successfully completed jobs relative to the total number of arrived jobs. This paper is the first step towards a comprehensive comparison of redundancy and rejuvenation, i.e., the preventive restart of servers when studying service availability. The question we strive to answer in this paper is whether and to what degree additional redundant servers can increase service availability in all load scenarios. We find that the first redundant server improves service availability by almost 90% in a highly loaded system, while adding a second and third redundant server yields further but much lower improvement. Under low load the benefit of additional servers is not as pronounced.