Performance evaluation of a parallel sparse lattice Boltzmann solver
Journal of Computational Physics
TeraFLOP computing on a desktop PC with GPUs for 3D CFD
International Journal of Computational Fluid Dynamics - Mesoscopic Methods And Their Applications To CFD
Computing and Visualization in Science
Lattice Boltzmann method simulations on massively parallel multi-core architectures
Proceedings of the 19th High Performance Computing Symposia
Hi-index | 0.00 |
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluster based on Nvidia Fermi processors. We analyze how to optimize the algorithm for GP-GPU architectures, describe the implementation choices that we have adopted and compare our performance results with an implementation optimized for latest generation multi-core CPUs. Our program runs at ≈30% of the double-precision peak performance of one GPU and shows almost linear scaling when run on the multi-GPU cluster.