Parallel programming in OpenMP
Parallel programming in OpenMP
MPI: The Complete Reference
CUDA by Example: An Introduction to General-Purpose GPU Programming
CUDA by Example: An Introduction to General-Purpose GPU Programming
Astrophysical particle simulations with large custom GPU clusters on three continents
Computer Science - Research and Development
OpenCL Programming Guide
Hi-index | 31.45 |
We present a new implementation of the numerical integration of the classical, gravitational, N-body problem based on a high order Hermite's integration scheme with block time steps, with a direct evaluation of the particle-particle forces. The main innovation of this code (called HiGPUs) is its full parallelization, exploiting both OpenMP and MPI in the use of the multicore Central Processing Units as well as either Compute Unified Device Architecture (CUDA) or OpenCL for the hosted Graphic Processing Units. We tested both performance and accuracy of the code using up to 256GPUs in the supercomputer IBM iDataPlex DX360M3 Linux Infiniband Cluster provided by the Italian supercomputing consortium CINECA, for values of N=