Distributed Fault Localization in Hierarchically Routed Networks

  • Authors:
  • Malgorzata Steinder;Adarsh Sethi

  • Affiliations:
  • -;-

  • Venue:
  • DSOM '02 Proceedings of the 13th IFIP/IEEE International Workshop on Distributed Systems: Operations and Management: Management Technologies for E-Commerce and E-Business Applications
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Probabilistic inference was shown effective in non-deterministic diagnosis of end-to-end service failures. To overcome the exponential complexity of the exact inference algorithms in fault propagation models represented by graphs with undirected loops, Pearl's iterative algorithms for polytrees were used as an approximation schema. The approximation made it possible to diagnose end-to-end service failures in network topologies composed of tens of nodes. This paper proposes a distributed algorithm that increases the admissible network size by an order of magnitude. The algorithm divides the computational effort and system knowledge among multiple, hierarchically organized managers. The cooperation among managers is illustrated with examples, and the results of a preliminary performance study are presented.