Simulating computer systems: techniques and tools
Simulating computer systems: techniques and tools
Implementation of Online Distributed System-Level Diagnosis Theory
IEEE Transactions on Computers - Special issue on fault-tolerant computing
Fault-tolerant computer system design
A Hierarchical Adaptive Distributed System-Level Diagnosis Algorithm
IEEE Transactions on Computers
Distributed Diagnosis in Dynamic Fault Environments
IEEE Transactions on Parallel and Distributed Systems
Efficient Fault Identification of Diagnosable Systems under the Comparison Model
IEEE Transactions on Computers
Towards Standardized and Automated Fault Management and Service Provisioning for NGNs
Journal of Network and Systems Management
HyperBone: A Scalable Overlay Network Based on a Virtual Hypercube
CCGRID '08 Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid
Strongly Diagnosable Systems under the Comparison Diagnosis Model
IEEE Transactions on Computers
Comparison-Based System-Level Fault Diagnosis: A Neural Network Approach
IEEE Transactions on Parallel and Distributed Systems
Hi-index | 0.00 |
Distributed diagnosis allows a set of fault-free nodes to monitor the state of all nodes of a given system. Diagnosis is based on the results of tests, which are assigned among system nodes. Hierarchical testing assignments lead to efficient and scalable diagnosis algorithms, both in terms of the number of tests executed, and the latency. In this work we present a novel hierarchical testing strategy, called MoDiVHA. By obtaining as much diagnostic information as possible from each test, and avoiding tests on nodes about which information has been already obtained, MoDiVHA executes less tests in comparison with previously proposed assignments. Experimental results are presented from three series of simulations conducted to compute the number of tests and diagnosis latency for various system sizes and fault situations.