Scalable Detection of MPI-2 Remote Memory Access Inefficiency Patterns

  • Authors:
  • Marc-André Hermanns;Markus Geimer;Bernd Mohr;Felix Wolf

  • Affiliations:
  • Jülich Supercomputing Centre, Forschungszentrum Jülich, Germany;Jülich Supercomputing Centre, Forschungszentrum Jülich, Germany;Jülich Supercomputing Centre, Forschungszentrum Jülich, Germany;Jülich Supercomputing Centre, Forschungszentrum Jülich, Germany and Department of Computer Science, RWTH Aachen University, Germany

  • Venue:
  • Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Wait states in parallel applications can be identified by scanning event traces for characteristic patterns. In our earlier work, we have defined such patterns for mpi -2 one-sided communication, although still based on a trace-analysis scheme with limited scalability. Taking advantage of a new scalable trace-analysis approach based on a parallel replay, which was originally developed for mpi -1 point-to-point and collective communication, we show how wait states in one-sided communications can be detected in a more scalable fashion. We demonstrate the scalability of our method and its usefulness for the optimization cycle with applications running on up to 8,192 cores.