Automated Online Monitoring of Distributed Applications through External Monitors
IEEE Transactions on Dependable and Secure Computing
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Automated Rule-Based Diagnosis through a Distributed Monitor System
IEEE Transactions on Dependable and Secure Computing
Using status messages in the distributed test architecture
Information and Software Technology
Dependence-based multi-level tracing and replay for wireless sensor networks debugging
Proceedings of the 2011 SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems
Constructing formal rules to verify message communication in distributed systems
The Journal of Supercomputing
Specification and verification of reliability in dispatching multicast messages
The Journal of Supercomputing
Hi-index | 0.00 |
This paper proposes a specification-based monitoring approach for automatic run-time detection of software errors and failures of distributed systems. The specification is assumed to be expressed in communicating finite state machines based formalism. The monitor observes the external I/O and partial state information of the target distributed system and uses them to interpret the specification. The approach is compositional as it achieves global monitoring bycombining the component-level monitoring. The core of the paper describes the architecture and operations of the monitor. The monitor includes several independent mechanisms,each tailored to detecting specific kinds of errors or failures. Their operations are described in detail using illustrative examples. Techniques for dealing with nondeterminism and concurrency issues in monitoring a distributed system are also discussed with respect to the considered model and specification. A case study describing the application of the prototype monitor to an embedded system is presented.