Software fault isolation in wide area networks

  • Authors:
  • Dinesh Gambhir;Ivan Frish;Micheal Post

  • Affiliations:
  • Farleigh Dickinson U. Madison, NJ;Polytechnic U. Brooklyn, NY;Bellcore, Redbank, NJ

  • Venue:
  • CSC '92 Proceedings of the 1992 ACM annual conference on Communications
  • Year:
  • 1992

Quantified Score

Hi-index 0.00

Visualization

Abstract

The problem of real-time detection and isolation of errors in distributed software systems operating in a wide-area networked environment is considered. The approach presented combines the results of static software analysis with dynamic event-driven monitoring. Static software analysis is used to generate a model of the distributed system. The model describes all possible executions of the processes composing the distributed system. The event-driven monitoring algorithm upon detecting an erroneous event uses the model to isolate the distributed software process states causing the fault. Because this approach does not require the use of the network for fault isolation, it is ideal for use in the low-bandwidth, high-latency communications environments characterizing wide-area networks.