Automatic Transformation for Overlapping Communication and Computation

  • Authors:
  • Changjun Hu;Yewei Shao;Jue Wang;Jianjiang Li

  • Affiliations:
  • School of Information Engineering, University of Science and Technology Beijing, Beijing, P.R.China;School of Information Engineering, University of Science and Technology Beijing, Beijing, P.R.China;School of Information Engineering, University of Science and Technology Beijing, Beijing, P.R.China;School of Information Engineering, University of Science and Technology Beijing, Beijing, P.R.China

  • Venue:
  • NPC '08 Proceedings of the IFIP International Conference on Network and Parallel Computing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Message-passing is a predominant programming paradigm for distributed memory systems. RDMA networks like infiniBand and Myrinet reduce communication overhead by overlapping communication with computation. For the overlap to be more effective, we propose a source-to-source transformation scheme by automatically restructuring message-passing codes. The extensions to control-flow graph can accurately analyze the message-passing program and help perform data-flow analysis effectively. This analysis identifies the minimal region between producer and consumer, which contains message-passing functional calls. Using inter-procedural data-flow analysis, the transformation scheme enables the overlap of communication with computation. Experiments on the well-known NAS Parallel Benchmarks show that for distributed memory systems, versions employing communication-computation overlap are faster than original programs.