The Concept of Coverage and Its Effect on the Reliability Model of a Repairable System
IEEE Transactions on Computers
A Reliability Model for Gracefully Degrading and Standby-Sparing Systems
IEEE Transactions on Computers
Reliability Modeling for Fault-Tolerant Computers
IEEE Transactions on Computers
Some relationships between failure detection probability and computer system reliability
AFIPS '67 (Fall) Proceedings of the November 14-16, 1967, fall joint computer conference
AFIPS '70 (Spring) Proceedings of the May 5-7, 1970, spring joint computer conference
Recovery through programming system/360: system/370
AFIPS '71 (Spring) Proceedings of the May 18-20, 1971, spring joint computer conference
AFIPS '72 (Fall, part I) Proceedings of the December 5-7, 1972, fall joint computer conference, part I
Error Detection Process Model, Design, and Its Impact on Computer Performance
IEEE Transactions on Computers
IEEE Transactions on Computers
A Simplified Method to Calculate Failure Times in Fault-Tolerant Systems
IEEE Transactions on Computers
Automatic Generation of Symbolic Reliability Functions for Processor-Memory-Switch Structures
IEEE Transactions on Computers - Lecture notes in computer science Vol. 174
Higher reliability redundant disk arrays: Organization, operation, and coding
ACM Transactions on Storage (TOS)
Rebuild processing in RAID5 with emphasis on the supplementary parity augmentation method[37]
ACM SIGARCH Computer Architecture News
Hi-index | 14.99 |
The diversified nature of fault-tolerant computers led to the development of a multiplicity of reliability models which are seemingly unrelated to each other. As a result, it becomes difficult to develop automated tools for reliability analysis which are both general and efficient. Thus, the potential of reliability modeling as a practical and useful tool in the design process of fault-tolerant computers has not been fully realized. This paper summarizes the results of an extended effort to develop a unified approach to reliability modeling of fault-tolerant computers which strikes a good compromise between generality and practicality. The unified model developed encompasses repairable and nonrepairable systems and models, transient as well as permanent faults, and their recovery. Based on the unified model, a powerful and efficient reliability estimation program ARIES has been developed.