Linear algebra operators for GPU implementation of numerical algorithms
ACM SIGGRAPH 2003 Papers
Sparse matrix solvers on the GPU: conjugate gradients and multigrid
ACM SIGGRAPH 2003 Papers
Brook for GPUs: stream computing on graphics hardware
ACM SIGGRAPH 2004 Papers
Lattice-Based Flow Field Modeling
IEEE Transactions on Visualization and Computer Graphics
Dispersion Simulation and Visualization For Urban Security
VIS '04 Proceedings of the conference on Visualization '04
GPU Cluster for High Performance Computing
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
An improved study of real-time fluid simulation on GPU: Research Articles
Computer Animation and Virtual Worlds - Special Issue: The Very Best Papers from CASA 2004
MoXi: real-time ink dispersion in absorbent paper
ACM SIGGRAPH 2005 Papers
Simulation of miscible binary mixtures based on lattice Boltzmann method: Research Articles
Computer Animation and Virtual Worlds - CASA 2006
Asymptotic analysis of the lattice Boltzmann equation
Journal of Computational Physics
Visual Simulation of Heat Shimmering and Mirage
IEEE Transactions on Visualization and Computer Graphics
Accelerating geoscience and engineering system simulations on graphics hardware
Computers & Geosciences
Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU
Proceedings of the 37th annual international symposium on Computer architecture
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
Application of Lattice Boltzmann Method to sensitivity analysis via complex differentiation
Journal of Computational Physics
Advances in Engineering Software
Free surface flow simulations on GPGPUs using the LBM
Computers & Mathematics with Applications
Multi-thread implementations of the lattice Boltzmann method on non-uniform grids for CPUs and GPUs
Computers & Mathematics with Applications
A new approach to the lattice Boltzmann method for graphics processing units
Computers & Mathematics with Applications
The TheLMA project: Multi-GPU implementation of the lattice Boltzmann method
International Journal of High Performance Computing Applications
The Journal of Supercomputing
Combining lattice Boltzmann and discrete element methods on a graphics processor
International Journal of High Performance Computing Applications
A Multi-GPU implementation of a d2q37 lattice boltzmann code
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Simulation of surface fire fronts using fireLib and GPUs
Environmental Modelling & Software
On enhanced non-linear free surface flow simulations with a hybrid LBM-VOF model
Computers & Mathematics with Applications
Multi-GPU implementation of the lattice Boltzmann method
Computers & Mathematics with Applications
Computers & Mathematics with Applications
Cellular Automata and GPGPU: An Application to Lava Flow Modeling
International Journal of Grid and High Performance Computing
A lattice Boltzmann method for immiscible two-phase Stokes flow with a local collision operator
Computers & Mathematics with Applications
Efficient GPU implementation of the linearly interpolated bounce-back boundary condition
Computers & Mathematics with Applications
Towards energy-efficient parallel analysis of neural signals
Cluster Computing
Computers & Mathematics with Applications
GPU accelerated lattice Boltzmann simulation for rotational turbulence
Computers & Mathematics with Applications
Recent progress and challenges in exploiting graphics processors in computational fluid dynamics
The Journal of Supercomputing
Hi-index | 0.00 |
A very efficient implementation of a lattice Boltzmann (LB) kernel in 3D on a graphical processing unit using the compute unified device architecture interface developed by nVIDIA is presented. By exploiting the explicit parallelism offered by the graphics hardware, we obtain an efficiency gain of up to two orders of magnitude with respect to the computational performance of a PC. A non-trivial example shows the performance of the LB implementation, which is based on a D3Q13 model that is described in detail.