dproc - Extensible Run-Time Resource Monitoring for Cluster Applications

  • Authors:
  • Jasmina Jancic;Christian Poellabauer;Karsten Schwan;Mathhew Wolf;Neil Bright

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • ICCS '02 Proceedings of the International Conference on Computational Science-Part II
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we describe the dproc (distributed /proc) kernellevel mechanisms and abstractions, which provide the building blocks for implementation of efficient, cluster-wide, and application-specific performance monitoring. Such monitoring functionality may be constructed at any time, both before and during application invocation, and can include dynamic run-time extensions. This paper (i) presents dproc's implementation in a Linux-based cluster of SMP-machines, and (ii) evaluates its utility by construction of sample monitoring functionality. Full version of this paper can be found at: http://www.cc.gatech.edu/systems/projects/dproc/