Reliability modeling techniques for self-repairing computer systems
ACM '69 Proceedings of the 1969 24th national conference
Switch Complexity in Systems with Hybrid Redundancy
IEEE Transactions on Computers
An Iterative Cell Switch Design for Hybrid Redundancy
IEEE Transactions on Computers
IEEE Transactions on Computers
Fault-Tolerance of the Iterative Cell Array Switch for Hybrid Redundancy
IEEE Transactions on Computers
An Algorithm for the Accurate Reliability Evaluation of Triple Modular Redundancy Networks
IEEE Transactions on Computers
On Reliability Modeling and Analysis of Ultrareliable Fault-Tolerant Digital Systems
IEEE Transactions on Computers
Reliability Modeling for Fault-Tolerant Computers
IEEE Transactions on Computers
Design of fault-tolerant computers
AFIPS '67 (Fall) Proceedings of the November 14-16, 1967, fall joint computer conference
AFIPS '70 (Spring) Proceedings of the May 5-7, 1970, spring joint computer conference
A framework for hardware-software tradeoffs in the design of fault-tolerant computers
AFIPS '72 (Fall, part I) Proceedings of the December 5-7, 1972, fall joint computer conference, part I
The use of triple-modular redundancy to improve computer reliability
IBM Journal of Research and Development
On-line Testing and Recovery in TMR Systems for Real-Time Applications
ITC '01 Proceedings of the 2001 IEEE International Test Conference
A Hardware Redundancy Reconfiguration Scheme for Tolerating Multiple Module Failures
IEEE Transactions on Computers
Performance-Related Reliability Measures for Computing Systems
IEEE Transactions on Computers
Fault-Tolerant Computing: A Introduction
IEEE Transactions on Computers
IEEE Transactions on Computers
Reliability Measure of Hardware Redundancy Fault-Tolerant Digital Systems with Intermittent Faults
IEEE Transactions on Computers
An Algebraic Model of Fault-Masking Logic Circuits
IEEE Transactions on Computers
An overview of fault-tolerant digital system architecture
AFIPS '77 Proceedings of the June 13-16, 1977, national computer conference
Modular redundancy without voters decreases complexity of restoring organ
AFIPS '77 Proceedings of the June 13-16, 1977, national computer conference
System-on-Chip Test Architectures: Nanometer Design for Testability
System-on-Chip Test Architectures: Nanometer Design for Testability
Designs for dlagnosablllty and reliability in VLSI systems
ITC'88 Proceedings of the 1988 international conference on Test: new frontiers in testing
Hi-index | 15.00 |
The goals of this paper are to present an efficient redundancy scheme for highly reliable systems, to give a method to compute the exact reliability of such systems and to compare this scheme with other redundancy schemes. This redundancy scheme is self-purging redundancy, a scheme that uses a threshold voter and that purges the failed modules. Switches for self-purging systems are extremely simple: there is no replacement of the failed modules and module purging is quite simply implemented. Because of switch simplicity, exact reliability calculations re possible. The effects of switch reliability are quantitatively examined. For short mission times, switch reliability is the most important factor: self-purging systems have a probability of failure several times larger than the figure obtained when switches are assumed to be perfect. The influence of the relative frequency of the diverse types of failures (permanent, intermittent, stuck-at, multiple,...) is also investigated. Reliability functions, mission time improvements, and switch efficiency are computed and displayed. Self-purging systems are compared with other redundant systems, like hybrid or NMR, for their relative merits in reliability gain, simplicity, cost, and confidence in the reliability estimation. The high confidence in the reliability evaluation of self-purging systems makes them a standard for the validation of several models that have been proposed to take into account switch reliability. The accuracy of the models using coverage factors can be evaluated in this way.