Segregated failures model for availability evaluation of fault-tolerant systems
ACSC '06 Proceedings of the 29th Australasian Computer Science Conference - Volume 48
Parameterized reliability prediction for component-based software architectures
QoSA'10 Proceedings of the 6th international conference on Quality of Software Architectures: research into Practice - Reality and Gaps
Hi-index | 0.01 |
The use of several distinct recovery procedures is one of the techniques that can be used to ensure high availability and fault-tolerance of computer systems. This method has been applied to telecommunications systems and usually uses redundant hardware and special recovery software to restore the system after hardware and software failures. We propose a simple practical analytical approach to availability evaluation of systems with several recovery procedures based on a new ýsegregated failuresý model. To illustrate this method, it is applied to availability evaluation of a Lucent Technologies Reliable Clustered Computing application. Detailed numerical results are provided and the impact of various types of failures and coverage factors on down time is analysed.