A preliminary evaluation of the hardware acceleration of the Cray Gemini interconnect for PGAS languages and comparison with MPI

  • Authors:
  • Hongzhang Shan;Nicholas J. Wright;John Shalf;Katherine Yelick;Marcus Wagner;Nathan Wichmann

  • Affiliations:
  • CRD and NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA;CRD and NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA;CRD and NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA;CRD and NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA;Cray Inc. 380 Jackson Street, Paul, MN;Cray Inc. 380 Jackson Street, Paul, MN

  • Venue:
  • ACM SIGMETRICS Performance Evaluation Review
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Gemini interconnect on the Cray XE6 platform provides for lightweight remote direct memory access (RDMA) between nodes, which is useful for implementing partitioned global address space (PGAS) languages like UPC and Co-Array Fortran. In this paper, we perform a study of Gemini performance using a set of communication microbenchmarks and compare the performance of one-sided communication in PGAS languages with two-sided MPI. Our results demonstrate the performance benefits of the PGAS model on Gemini hardware, showing in what circumstances and by how much one-sided communication outperforms two-sided in terms of messaging rate, aggregate bandwidth, and computation and communication overlap capability. For example, for 8-byte and 2KB messages the one-sided messaging rate is 5 and 10 times greater respectively than the twosided one. The study also reveals important information about how to optimize one-sided Gemini communication.