3.5-D Blocking Optimization for Stencil Computations on Modern CPUs and GPUs
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Free surface flow simulations on GPGPUs using the LBM
Computers & Mathematics with Applications
The TheLMA project: Multi-GPU implementation of the lattice Boltzmann method
International Journal of High Performance Computing Applications
Fast ARFTIS reconstruction algorithms using CUDA
HPCA'09 Proceedings of the Second international conference on High Performance Computing and Applications
Computers & Mathematics with Applications
A Multi-GPU implementation of a d2q37 lattice boltzmann code
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
On enhanced non-linear free surface flow simulations with a hybrid LBM-VOF model
Computers & Mathematics with Applications
Computers & Mathematics with Applications
Cellular Automata and GPGPU: An Application to Lava Flow Modeling
International Journal of Grid and High Performance Computing
GPU implementation of a novel hybrid lattice Boltzmann method for non-isothermal flows
Proceedings of the 5th ACM COMPUTE Conference: Intelligent & scalable system technologies
Hi-index | 0.00 |
In this article a very efficient implementation of a 2D-Lattice Boltzmann kernel using the Compute Unified Device Architecture (CUDA™) interface developed by nVIDIA® is presented. By exploiting the explicit parallelism exposed in the graphics hardware we obtain more than one order in performance gain compared to standard CPUs. A non-trivial example, the flow through a generic porous medium, shows the performance of the implementation.