Latency-Optimized Parallelization of the FMM Near-Field Computations

  • Authors:
  • Ivo Kabadshow;Bruno Lang

  • Affiliations:
  • John von Neumann Institute for Computing, Central Institute for Applied Mathematics, Research Centre Jülich, Germany;Applied Computer Science and Scientific Computing Group, Department of Mathematics, University of Wuppertal, Germany

  • Venue:
  • ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a new parallelization scheme for the FMM near-field. The parallelization is based on the Global Arrays Toolkit and uses one-sided communication with overlapping. It employs a purely static load-balancing approach to minimize the number of communication steps and benefits from a maximum utilization of data locality. In contrast to other implementations the communication is initiated by the process owning the data via a putcall, not the process receiving the data (via a getcall).