Accelerating a paricle -in-cell simulation using a hybrid counting sort
Journal of Computational Physics
Plasma Physics Via Computer
Introduction to Algorithms
Optimization of PIC codes by improved memory management
Journal of Computational Physics
Fast multipole methods on graphics processors
Journal of Computational Physics
A Particle-Mesh Integrator for Galactic Dynamics Powered by GPGPUs
ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
Memory-efficient optimization of Gyrokinetic particle-to-grid interpolation for multicore processors
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Accelerating POCS interpolation of 3D irregular seismic data with Graphics Processing Units
Computers & Geosciences
Journal of Real-Time Image Processing
Particle-in-cell simulations with charge-conserving current deposition on graphic processing units
Journal of Computational Physics
Journal of Computational Physics
Fine-grained parallelization of a Vlasov-poisson application on GPU
Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Journal of Computational Physics
Radiative signatures of the relativistic Kelvin-Helmholtz instability
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
ANCS '13 Proceedings of the ninth ACM/IEEE symposium on Architectures for networking and communications systems
Hi-index | 0.01 |
Particle-In-Cell (PIC) methods have been widely used for plasma physics simulations in the past three decades. To ensure an acceptable level of statistical accuracy relatively large numbers of particles are needed. State-of-the-art Graphics Processing Units (GPUs), with their high memory bandwidth, hundreds of SPMD processors, and half-a-teraflop performance potential, offer a viable alternative to distributed memory parallel computers for running medium-scale PIC plasma simulations on inexpensive commodity hardware. In this paper, we present an overview of a typical plasma PIC code and discuss its GPU implementation. In particular we focus on fast algorithms for the performance bottleneck operation of Particle-To-Grid interpolation.