Reliable Fault Diagnosis with Few Tests

  • Authors:
  • Andrzej Pelc;Eli Upfal

  • Affiliations:
  • Département d'Informatique, Université du Québec à Hull, Hull, Québec J8X 3X7, Canada (e-mail: pelc@uqah.uquebec.ca);IBM Almaden Research Center, San Jose, CA 95120, USA, and Department of Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel (e-mail: eli@wisdom.weizmann.ac.il)

  • Venue:
  • Combinatorics, Probability and Computing
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the problem of fault diagnosis in multiprocessor systems. Processors perform tests on one another: fault-free testers correctly identify the fault status of tested processors, while faulty testers can give arbitrary test results. Processors fail independently with constant probability pqq-diagnosis is a fault diagnosis algorithm whose probability of error does not exceed q. We show that the minimum number of tests to perform q-diagnosis for n processors is Θ(n log 1/q) in the nonadaptive case and n+Θ( log 1/q) in the adaptive case. We also investigate q-diagnosis algorithms that minimize the maximum number of tests performed by, and performed on, processors in the system, constructing testing schemes in which each processor is involved in very few tests. Our results demonstrate that the flexibility yielded by adaptive testing permits a significant saving in the number of tests for the same reliability of diagnosis.