Tradeoffs in system level diagnosis of multiprocessor systems

Authors:
A. Kavianpour;A. D. Friedman
Affiliations:
Sharif University of Technology, Tehran, Iran;The George Washington University, Washington D.C.
Venue:
AFIPS '84 Proceedings of the July 9-12, 1984, national computer conference and exposition
Year:
1984

Citing 6
Cited 1

Characterization of Connection Assignment of Diagnosable Systems

IEEE Transactions on Computers
An Analysis Model for Digital System Diagnosis

IEEE Transactions on Computers
System Fault Diagnosis: Closure and Diagnosability with Repair

IEEE Transactions on Computers
System Fault Diagnosis: Masking, Exposure, and Diagnosability Without Repair

IEEE Transactions on Computers
A self-diagnosable computer

AFIPS '65 (Fall, part I) Proceedings of the November 30--December 1, 1965, fall joint computer conference, part I
A structural theory of machine diagnosis

AFIPS '67 (Spring) Proceedings of the April 18-20, 1967, spring joint computer conference

A linear time pessimistic one-step diagnosis algorithm for hypercube multicomputer systems

Parallel Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

The development of LSI technology makes it possible to partition a system into replaceable modules, and the advent of low-cost microprocessors makes possible networks of hundreds (or more) of interconnected modules. The problem of repairing such a system is becoming a matter of major importance in digital systems. In this paper a new procedure for defining an optimal design with respect to cost of repair for a system consisting of replaceable modules (processors) is introduced. Also the tradeoff between the number of repetitions of the diagnostic test (speed of diagnosis), the number of testing links in the system (complexity), and the number of replaced fault-free modules (accuracy) is considered. In an early paper, Preparata, Metze, and Chien4 formulated a model of system level diagnosis and defined two types of diagnosability measures, i.e. one-step t-fault diagnosability, and sequential t-fault diagnosability. They proved that Dδt is one-step t-fault diagnosable and single loop connection is sequentially t-fault diagnosable. Friedman12 later generalized this measure to one-step t-out-of-S (t/S) diagnosability, in which t faults are diagnosed to within S ≥ t modules. This introduces the possibility of inexact diagnosis---i.e. such that some fault-free modules may have to be replaced in order to repair a system in one step. So far most of the results that are available are only for single-loop or Dδt design, and the results for a system in between these two extreme cases are not available. A Dδt system needs more testing links and a single-loop system needs more steps in order to be repaired. In this paper we have defined a design in between Dδt and single-loop systems; also the tradeoff between the number of repetitions of the diagnostic test (speed of diagnosis), the number of testing links (complexity), and the number of replaced fault-free modules (accuracy) is considered, and the optimal design with respect to cost of repair is defined.