Grid infrastructure monitoring system based on Nagios

  • Authors:
  • Emir Imamagic;Dobrisa Dobrenic

  • Affiliations:
  • University of Zagreb;University of Zagreb

  • Venue:
  • Proceedings of the 2007 workshop on Grid monitoring
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

Monitoring in distributed environment such as a grid is crucial for normal operation of all subsystems. Constant gathering of information enables efficient security auditing, failure detection, maintenance, job scheduling, accounting, resource performance tuning, debugging, etc. In this paper we focus on monitoring of resources in the grid with the purpose of failure detection, notifications and automatic recovery. We introduce our system based on open source monitoring framework Nagios that achieves these functionalities. We describe grid specific features we implemented in order to achieve efficient grid monitoring system, namely sensors for various grid services, advanced sensor hierarchy and certificate-based authorization on web interface. Finally, we give overview of the implementation of our system for monitoring EGEE grid infrastructure.