Improving Multi-million Virtual Rank MPI Execution in [MUPI]

  • Authors:
  • Kalyan S. Perumalla;Alfred J. Park

  • Affiliations:
  • -;-

  • Venue:
  • MASCOTS '11 Proceedings of the 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

MUPI is a parallel discrete event simulator designed for enabling software-based experimentation via simulated execution across a range of synthetic to unmodified parallel programs using the Message Passing Interface (MPI) with millions of tasks. Here, we report work in progress in improving the efficiency of 聝脢 . Among the issues uncovered are the scaling problems with implementing barriers and intertask message ordering. Preliminary performance shows the possibility of supporting hundreds of virtual MPI ranks per real processor core. Performance improvements of at least 2聛~ are observed, and enable execution of benchmark MPI runs with over 16 million virtual ranks synchronized in a discrete event fashion on as few as 16,128 real cores of a Cray XT5.