A fast algorithm for particle simulations
Journal of Computational Physics
A parallel hashed Oct-Tree N-body algorithm
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Skeletons from the treecode closet
Journal of Computational Physics
Parallel methods for integrating ordinary differential equations
Communications of the ACM
Analysis of the Parareal Time-Parallel Time-Integration Method
SIAM Journal on Scientific Computing
Journal of Computational Physics
The International Exascale Software Project roadmap
International Journal of High Performance Computing Applications
Journal of Computational Physics
Scheduling of tasks in the parareal algorithm
Parallel Computing
Event-based parareal: A data-flow based implementation of parareal
Journal of Computational Physics
Parallel bucket-brigade communication interface for scientific applications
Proceedings of the 20th European MPI Users' Group Meeting
Hi-index | 0.00 |
We present a novel space-time parallel version of the Barnes-Hut tree code pepc using pfasst, the Parallel Full Approximation Scheme in Space and Time. The naive use of increasingly more processors for a fixed-size N-body problem is prone to saturate as soon as the number of unknowns per core becomes too small. To overcome this intrinsic strong-scaling limit, we introduce temporal parallelism on top of pepc's existing hybrid MPI/PThreads spatial decomposition. Here, we use pfasst which is based on a combination of the iterations of the parallel-in-time algorithm parareal with the sweeps of spectral deferred correction (SDC) schemes. By combining these sweeps with multiple space-time discretization levels, pfasst relaxes the theoretical bound on parallel efficiency in parareal. We present results from runs on up to 262,144 cores on the IBM Blue Gene/P installation JUGENE, demonstrating that the space-time parallel code provides speedup beyond the saturation of the purely space-parallel approach.