A fast algorithm for particle simulations
Journal of Computational Physics
Analysis of Multi-Dimensional Space-Filling Curves
Geoinformatica
Communications overlapping in fast multipole particle dynamics methods
Journal of Computational Physics
Massively parallel implementation of a fast multipole method for distributed memory machines
Journal of Parallel and Distributed Computing
Advances, Applications and Performance of the Global Arrays Shared Memory Programming Toolkit
International Journal of High Performance Computing Applications
High Performance Remote Memory Access Communication: The Armci Approach
International Journal of High Performance Computing Applications
Hi-index | 0.00 |
In this paper we present a new parallelization scheme for the FMM near-field. The parallelization is based on the Global Arrays Toolkit and uses one-sided communication with overlapping. It employs a purely static load-balancing approach to minimize the number of communication steps and benefits from a maximum utilization of data locality. In contrast to other implementations the communication is initiated by the process owning the data via a putcall, not the process receiving the data (via a getcall).