A systematic multi-step methodology for performance analysis of communication traces of distributed applications based on hierarchical clustering

  • Authors:
  • Gaby Aguilera;Patricia J. Teller;Michela Taufer;Felix Wolf

  • Affiliations:
  • University of Texas-El Paso, El Paso, TX;University of Texas-El Paso, El Paso, TX;University of Texas-El Paso, El Paso, TX;Forschungszentrum Jülich, Jülich, Germany

  • Venue:
  • IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Often parallel scientific applications are instrumented and traces are collected and analyzed to identify processes with performance problems or operations that cause delays in program execution. The execution of instrumented codes may generate large amounts of performance data, and the collection, storage, and analysis of such traces are time and space demanding. To address this problem, this paper presents an efficient, systematic, multi-step methodology, based on hierarchical clustering, for analysis of communication traces of parallel scientific applications. The methodology is used to discover potential communication performance problems of three applications: TRACE, REMO, and SWEEP3D.