A fault identification algorithm for ti-diagnosable systems
IEEE Transactions on Computers - The MIT Press scientific computation series
The Comparison Approach to Multiprocessor Fault Diagnosis
IEEE Transactions on Computers
Fault-tolerant computing: theory and techniques; Vol. 2
A Generalized Theory for System Level Diagnosis
IEEE Transactions on Computers
Almost sure fault tolerance in random graphs
SIAM Journal on Computing
A Distributed Algorithm for Fault Diagnosis in Systems with Soft Failures
IEEE Transactions on Computers
Reliable Broadcast in Hypercube Multicomputers
IEEE Transactions on Computers
Probabilistic multiprocessor and multicomputer diagnosis
Probabilistic multiprocessor and multicomputer diagnosis
Diagnosabilities of Hypercubes Under the Pessimistic One-Step Diagnosis Strategy
IEEE Transactions on Computers
Complexity of Fault Diagnosis in Comparison Models
IEEE Transactions on Computers
Distributed Diagnosis Algorithms for Regular Interconnected Structures
IEEE Transactions on Computers
Diagnosing Arbitrarily Connected Parallel Computers with High Probability
IEEE Transactions on Computers - Special issue on fault-tolerant computing
Implementation of Online Distributed System-Level Diagnosis Theory
IEEE Transactions on Computers - Special issue on fault-tolerant computing
Efficient Diagnosis of Multiprocessor Systems Under Probabilistic Models
IEEE Transactions on Computers
Intermittent Fault Diagnosis in Multiprocessor Systems
IEEE Transactions on Computers
On Self-Fault Diagnosis of the Distributed Systems
IEEE Transactions on Computers
Diagnosis and Repair in Multiprocessor Systems
IEEE Transactions on Computers
Optimal and Efficient Probabilistic Distributed Diagnosis Schemes
IEEE Transactions on Computers
Distributed fault-tolerance for large multiprocessor systems
ISCA '80 Proceedings of the 7th annual symposium on Computer Architecture
Fault detection and diagnosis in multiprocessor systems
Fault detection and diagnosis in multiprocessor systems
Optimal Diagnosis of Heterogeneous Systems with Random Faults
IEEE Transactions on Computers
Searching games with errors---fifty years of coping with liars
Theoretical Computer Science
Network management and system-level diagnosis
ICCCN '95 Proceedings of the 4th International Conference on Computer Communications and Networks
Diagnosis service for embedded software component based systems
Proceedings of the 2007 workshop on Engineering fault tolerant systems
International Journal of Parallel, Emergent and Distributed Systems
A real-time system-adapted anomaly detector
Information Sciences: an International Journal
Fast adaptive diagnosis with a minimum number of tests
ISAAC'07 Proceedings of the 18th international conference on Algorithms and computation
Formal passive testing of timed systems: theory and tools
Software Testing, Verification & Reliability
Hi-index | 0.01 |
This paper critically surveys methods for the automated probabilistic diagnosis of large multiprocessor systems. In recent years, much of the work on system-level diagnosis has focused on probabilistic methods, which can diagnose intermittently faulty processing nodes and can be applied in general situations on general interconnection networks. The theory behind the probabilistic diagnosis methods is explained, and the various diagnosis algorithms are described in simple terms with the aid of a running example. The diagnosis methods are compared and analyzed, and a chart is produced, showing the comparative advantages of the various diagnosis algorithms on the basis of several factors important to the probabilistic diagnosis.