Near-optimal placement of MPI processes on hierarchical NUMA architectures

  • Authors:
  • Emmanuel Jeannot;Guillaume Mercier

  • Affiliations:
  • LaBRI and INRIA Bordeaux Sud-Ouest;LaBRI and INRIA Bordeaux Sud-Ouest and Institut Polytechnique de Bordeaux

  • Venue:
  • Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

MPI process placement can play a deterministic role concerning the application performance. This is especially true with nowadays architecture (heterogenous, multicore with different level of caches, etc.). In this paper, we will describe a novel algorithm called TreeMatch that maps processes to resources in order to reduce the communication cost of the whole application. We have implemented this algorithm and will discuss its performance using simulation and on the NAS benchmarks.