Maintenance of Monitoring Systems Throughout Self-healing Mechanisms

  • Authors:
  • Clarissa Cassales Marquezan;André Panisson;Lisandro Zambenedetti Granville;Giorgio Nunzi;Marcus Brunner

  • Affiliations:
  • Federal University of Rio Grande do Sul, Porto Alegre, Brazil and NEC Europe Network Laboratories, Heidelberg, Germany;Federal University of Rio Grande do Sul, Porto Alegre, Brazil;Federal University of Rio Grande do Sul, Porto Alegre, Brazil;NEC Europe Network Laboratories, Heidelberg, Germany;NEC Europe Network Laboratories, Heidelberg, Germany

  • Venue:
  • DSOM '08 Proceedings of the 19th IFIP/IEEE international workshop on Distributed Systems: Operations and Management: Managing Large-Scale Service Deployment
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Monitoring is essential in modern network management. However, current monitoring systems are unable to recover their internal faulty entities forcing the network administrator to manually fix the occasionally broken monitoring solution. In this paper we address this issue by introducing a self-healing monitoring solution. This solution is described considering a scenario of a monitoring system for a Network Access Control (NAC) installation. The proposed solution combines the availability provided by P2P-based overlays with self-healing abilities. This paper also describes a set of experimental evaluations whose results present the tradeoff between the time required to recover the monitoring infrastructure when failures occur, and the associated bandwidth consumed in this process. Based on the experiments we show that it is possible to improve availability and robustness with minimum human intervention.