Scalable parallel trace-based performance analysis

  • Authors:
  • Markus Geimer;Felix Wolf;Brian J. N. Wylie;Bernd Mohr

  • Affiliations:
  • John von Neumann Institute for Computing (NIC), Forschungszentrum Jülich, Jülich, Germany;John von Neumann Institute for Computing (NIC), Forschungszentrum Jülich, Jülich, Germany;John von Neumann Institute for Computing (NIC), Forschungszentrum Jülich, Jülich, Germany;John von Neumann Institute for Computing (NIC), Forschungszentrum Jülich, Jülich, Germany

  • Venue:
  • EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic trace analysis is an effective method for identifying complex performance phenomena in parallel applications. However, as the size of parallel systems and the number of processors used by individual applications is continuously raised, the traditional approach of analyzing a single global trace file, as done by kojak's expert trace analyzer, becomes increasingly constrained by the large number of events. In this article, we present a scalable version of the expert analysis based on analyzing separate local trace files with a parallel tool which ‘replays' the target application's communication behavior. We describe the new parallel analyzer architecture and discuss first empirical results.