Locality and topology aware intra-node communication among multicore CPUs

  • Authors:
  • Teng Ma;George Bosilca;Aurelien Bouteiller;Jack J. Dongarra

  • Affiliations:
  • Innovative Computing Laboratory, University of Tennessee Computer Science Department, Knoxville, TN;Innovative Computing Laboratory, University of Tennessee Computer Science Department, Knoxville, TN;Innovative Computing Laboratory, University of Tennessee Computer Science Department, Knoxville, TN;Innovative Computing Laboratory, University of Tennessee Computer Science Department, Knoxville, TN

  • Venue:
  • EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A major trend in HPC is the escalation toward manycore, where systems are composed of shared memory nodes featuring numerous processing units. Unfortunately, with scale comes complexity, here in the form of non-uniform memory accesses and cache hierarchies. For most HPC applications, harnessing the power of multicores is hindered by the topology oblivious tuning of the MPI library. In this paper, we propose a framework to tune every type of shared memory communications according to locality and topology. An implementation inside Open MPI is evaluated experimentally and demonstrates significant speedups compared to vanilla Open MPI and MPICH2.