Benchmarking of communication techniques for GPUs

Authors:
M. Bernaschi;M. Bisson;D. Rossetti
Affiliations:
Istituto per le Applicazioni del Calcolo, National Research Council of Italy, Italy;Istituto per le Applicazioni del Calcolo, National Research Council of Italy, Italy;INFN, Roma, Italy
Venue:
Journal of Parallel and Distributed Computing
Year:
2013

Citing 2
Cited 0

Overlapping Computation and Communication for Advection on Hybrid Parallel Computers

IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
Performance potential for simulating spin models on GPU

Journal of Computational Physics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We report about the performances obtained, at the application level, by two MPI implementations for Infiniband that allow direct exchange of data stored in the global memory of Graphic Processing Units (GPU) based on the Nvidia CUDA. For the same purpose, we tested also the Application Programming Interface of APEnet, which is a custom, high performance interconnect technology. As a benchmark we consider the time required to update a single spin of the 3D Heisenberg spin glass model by using the over-relaxation algorithm. The results show that CUDA streams are instrumental in achieving the best possible performances.