Application-Dependent Dynamic Monitoring of Distributed and Parallel Systems

  • Authors:
  • D. M. Ogle;K. Schwan;Richard Thomas Snodgrass

  • Affiliations:
  • -;-;-

  • Venue:
  • IEEE Transactions on Parallel and Distributed Systems
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

Achieving high performance for parallel or distributed programs often requires substantial amounts of information about the programs themselves, about the systems on which they are executing, and about specific program runs. The monitoring system that collects, analyzes, and makes application-dependent monitoring information available to the programmer and to the executing program is presented. The system may be used foroff-line program analysis, for on-line debugging, and for making on-line, dynamic changes to parallel or distributed programs to enhance their performance. The authors use a high-level, uniform data model for the representation of program information andmonitoring data. They show how this model may be used for the specification of programviews and attributes for monitoring, and demonstrate how such specifications can betranslated into efficient, program-specific monitoring code that uses alternativemechanisms for the distributed analysis and collection to be performed for the specifiedviews. The model's utility has been demonstrated on a wide variety of parallel machines.