A fault identification algorithm for ti-diagnosable systems
IEEE Transactions on Computers - The MIT Press scientific computation series
IEEE Transactions on Computers - The MIT Press scientific computation series
The connection machine
The Comparison Approach to Multiprocessor Fault Diagnosis
IEEE Transactions on Computers
A Generalized Theory for System Level Diagnosis
IEEE Transactions on Computers
Almost sure fault tolerance in random graphs
SIAM Journal on Computing
Distributed fault-tolerance for large multiprocessor systems
ISCA '80 Proceedings of the 7th annual symposium on Computer Architecture
Fault detection and diagnosis in multiprocessor systems
Fault detection and diagnosis in multiprocessor systems
Intermittent Fault Diagnosis in Multiprocessor Systems
IEEE Transactions on Computers
The consensus problem in fault-tolerant computing
ACM Computing Surveys (CSUR)
Probabilistic diagnosis of multiprocessor systems
ACM Computing Surveys (CSUR)
Globally Optimal Diagnosis in Systems with Random Faults
IEEE Transactions on Computers
Optimal Diagnosis of Heterogeneous Systems with Random Faults
IEEE Transactions on Computers
Better Adaptive Diagnosis of Hypercubes
IEEE Transactions on Computers
Correct and Almost Complete Diagnosis of Processor Grids
IEEE Transactions on Computers
Diagnosis and Repair in Multiprocessor Systems
IEEE Transactions on Computers
Evaluation of a Diagnosis Algorithm for Regular Structures
IEEE Transactions on Computers
Almost Certain Fault Diagnosis Through Algorithm-Based Fault Tolerance
IEEE Transactions on Parallel and Distributed Systems
Fault-diagnosis of grid structures
Theoretical Computer Science - Dependable computing
Optimal Adaptive Fault Diagnosis of Hypercubes
SWAT '00 Proceedings of the 7th Scandinavian Workshop on Algorithm Theory
Reliable Fault Diagnosis with Few Tests
Combinatorics, Probability and Computing
Diagnosabilities of Regular Networks
IEEE Transactions on Parallel and Distributed Systems
Worst-Case Diagnosis Completeness in Regular Graphs under the PMC Model
IEEE Transactions on Computers
International Journal of Parallel, Emergent and Distributed Systems
An evolutionary approach to system-level fault diagnosis
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Sensor deployment for failure diagnosis in networked aerial robots: a satisfiability-based approach
SAT'07 Proceedings of the 10th international conference on Theory and applications of satisfiability testing
Sensor deployment for fault diagnosis using a new discrete optimization algorithm
Applied Soft Computing
Hi-index | 15.01 |
The problem of fault diagnosis in multiprocessor systems is considered under a probabilistic fault model. The focus is on minimizing the number of tests that must be conducted to correctly diagnose the state of every processor in the system with high probability. A diagnosis algorithm that can correctly diagnose these states with probability approaching one in a class of systems performing slightly greater than a linear number of tests is presented. A nearly matching lower bound on the number of tests required to achieve correct diagnosis in arbitrary systems is proved. Lower and upper bounds on the number of tests required for regular systems are presented. A class of regular systems which includes hypercubes is shown to be correctly diagnosable with high probability. In all cases, the number of tests required under this probabilistic model is shown to be significantly less than under a bounded-size fault set model. These results represent a very great improvement in the performance of system-level diagnosis techniques.