A message-based fault diagnosis procedure
SIGCOMM '86 Proceedings of the ACM SIGCOMM conference on Communications architectures & protocols
An Acyclic Expansion Algorithm for Fast Protocol Validation
IEEE Transactions on Software Engineering
A Distributed Algorithm for Fault Diagnosis in Systems with Soft Failures
IEEE Transactions on Computers
Local directed graphs
Error detection with multiple observers
Proceedings of the IFIP WG6.1 Fifth International Conference on Protocol Specification, Testing and Verification V
Hi-index | 0.00 |
The problem of real-time detection and isolation of errors in distributed software systems operating in a wide-area networked environment is considered. The approach presented combines the results of static software analysis with dynamic event-driven monitoring. Static software analysis is used to generate a model of the distributed system. The model describes all possible executions of the processes composing the distributed system. The event-driven monitoring algorithm upon detecting an erroneous event uses the model to isolate the distributed software process states causing the fault. Because this approach does not require the use of the network for fault isolation, it is ideal for use in the low-bandwidth, high-latency communications environments characterizing wide-area networks.