Reliability modeling techniques for self-repairing computer systems
ACM '69 Proceedings of the 1969 24th national conference
Fault-tolerance and fault-intolerance: Complementary approaches to reliable computing
Proceedings of the international conference on Reliable software
The Architectural Elements of a Symmetric Fault-Tolerant Multiprocessor
IEEE Transactions on Computers
The MECRA: A Self-Reconfigurable Computer for Highly Reliable Process
IEEE Transactions on Computers
Arithmetic Error Codes: Cost and Effectiveness Studies for Application in Digital System Design
IEEE Transactions on Computers
Modeling and Digital Simulation for Design Verification and Diagnosis
IEEE Transactions on Computers
SAGE: a data-processing system for air defense
IRE-ACM-AIEE '57 (Eastern) Papers and discussions presented at the December 9-13, 1957, eastern joint computer conference: Computers with deadlines to meet
Multics: the first seven years
AFIPS '72 (Spring) Proceedings of the May 16-18, 1972, spring joint computer conference
SIFT: software implemented fault tolerance
AFIPS '72 (Fall, part I) Proceedings of the December 5-7, 1972, fall joint computer conference, part I
Pluribus: a reliable multiprocessor
AFIPS '75 Proceedings of the May 19-22, 1975, national computer conference and exposition
A study of fault tolerance techniques for associative processors
AFIPS '74 Proceedings of the May 6-10, 1974, national computer conference and exposition
Design of serviceability features for the IBM system/360
IBM Journal of Research and Development
IBM Systems Journal
Development of on-board space computer systems
IBM Journal of Research and Development
Adaptive Application-Centric Management in Meta-computing Environments
EurAsia-ICT '02 Proceedings of the First EurAsian Conference on Information and Communication Technology
A concept for test and reconfiguration of a fault-tolerant VLSI processor system
ISCA '80 Proceedings of the 7th annual symposium on Computer Architecture
Towards a Control-Theoretical Approach to Software Fault-Tolerance
QSIC '04 Proceedings of the Quality Software, Fourth International Conference
Performance study of Byzantine Agreement Protocol with artificial neural network
Information Sciences: an International Journal
Acceptable Testing of VLSI Components Which Contain Error Correctors
IEEE Transactions on Computers
A Diagnosis Algorithm for Distributed Computing Systems with Dynamic Failure and Repair
IEEE Transactions on Computers
Synchronization and Matching in Redundant Systems
IEEE Transactions on Computers
Achieving software robustness via large-scale multiagent systems
Software engineering for large-scale multi-agent systems
Software health management with Bayesian networks
Innovations in Systems and Software Engineering
Hi-index | 14.99 |
Basic concepts, motivation, and techniques of fault tolerance are discussed in this paper. The topics include fault classification, redundancy techniques, reliability modeling and prediction, examples of fault-tolerant computers, and some approaches to the problem of tolerating design faults.