Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application

  • Authors:
  • Sreeram Potluri;Ping Lai;Karen Tomko;Sayantan Sur;Yifeng Cui;Mahidhar Tatineni;Karl W. Schulz;William L. Barth;Amitava Majumdar;Dhabhaleswar K. Panda

  • Affiliations:
  • The Ohio State University, Columbus, OH;The Ohio State University, Columbus, OH;Ohio Supercomputer Center, Columbus, OH;The Ohio State University, Columbus, OH;San Diego Supercomputer Center, San Diego, California;San Diego Supercomputer Center, San Diego, California;Texas Advanced Computing Center, Austin, Texas;Texas Advanced Computing Center, Austin, Texas;San Diego Supercomputer Center, San Diego, California;The Ohio State University, Columbus, OH

  • Venue:
  • Proceedings of the 24th ACM International Conference on Supercomputing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

AWM-Olsen is a widely used ground motion simulation code based on a parallel finite difference solution of the 3-D velocity-stress wave equation. This application runs on tens of thousands of cores and consumes several million CPU hours on the TeraGrid Clusters every year. A significant portion of its run-time (37% in a 4,096 process run), is spent in MPI communication routines. Hence, it demands an optimized communication design coupled with a low-latency, high-bandwidth network and an efficient communication subsystem for good performance. In this paper, we analyze the performance bottlenecks of the application with regard to the time spent in MPI communication calls. We find that much of this time can be overlapped with computation using MPI non-blocking calls. We use both two-sided and MPI-2 one-sided communication semantics to re-design the communication in AWM-Olsen. We find that with our new design, using MPI-2 one-sided communication semantics, the entire application can be sped up by 12% at 4K processes and by 10% at 8K processes on a state-of-the-art InfiniBand cluster, Ranger at the Texas Advanced Computing Center (TACC).