The NetLogger Methodology for High Performance Distributed Systems Performance Analysis

  • Authors:
  • Brian Tierney;William Johnston;Brian Crowley;Gary Hoo;Chris Brooks;Dan Gunter

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • HPDC '98 Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe a methodology that enables the real-time diagnosis of performance problems in complex high-performance distributed systems. The methodology includes tools for generating precision event logs that can be used to provide detailed end-to-end application and system level monitoring; a Java agent-based system for managing the large amount of logging data; and tools for visualizing the log data and real-time state of the distributed system. We developed these tools for analyzing a high-performance distributed system centered around the transfer of large amounts of data at high speeds from a distributed storage server to a remote visualization client. However, this methodology should be generally applicable to any distributed system.This methodology, called NetLogger, has proven invaluable for diagnosing problems in networks and in distributed systems code. This approach is novel in that it combines network, host, and application-level monitoring, providing a complete view of the entire system.