A universal concurrent algorithm for plasma particle-in-cell simulation codes
C3P Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2
A general concurrent algorithm for plasma particle-in-cell simulation codes
Journal of Computational Physics
Dynamic load balancing for a 2D concurrent plasma PIC code
Journal of Computational Physics
Particle-in-cell simulation codes in High Performance Fortran
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Parallel PIC plasma simulation through particle decomposition techniques
Parallel Computing
ICCS '02 Proceedings of the International Conference on Computational Science-Part III
The Analysis and Optimization of Collective Communications on a Beowulf Cluster
ICPADS '02 Proceedings of the 9th International Conference on Parallel and Distributed Systems
Parallel Particle-In-Cell Simulation of Colliding Beams in High Energy Accelerators
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
PPM: a highly efficient parallel particle-mesh library for the simulation of continuum systems
Journal of Computational Physics
ICKS '08 Proceedings of the International Conference on Informatics Education and Research for Knowledge-Circulating Society (icks 2008)
Two-constraint domain decomposition with Space Filling Curves
Parallel Computing
Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
ANCS '13 Proceedings of the ninth ACM/IEEE symposium on Architectures for networking and communications systems
Hi-index | 0.00 |
This paper proposes a new method for Particle-in-Cell (PIC) simulations which aims at achieving both good load balancing and scalability so as to be efficiently executed on distributed memory systems. This method, named OhHelp, simply and equally partitions the space domain where charged particles reside and assigns each partitioned subdomain to each computation node for scalable simulation with respect to the size of the domain. Load balancing and thus the scalability in terms of the number of particles are accomplished by making each node help another heavily loaded node which deputes a part of particles in its subdomain and replicated subdomain field data to its helpers. The OhHelp load balancer monitors particle movements through subdomain boundaries to check if the helpand-helpers configuration keeps good load balancing and, when unacceptable imbalance is found, dynamically reconfigures it to regain perfect balancing. The efficiency and scalability of OhHelp are confirmed through our experiment with a production-level full-3D plasma simulator and with uniform and non-uniform particle distributions. As a result, we found 256-core parallel simulations, including an extremely imbalanced setting to pack all the particles in a small region, exert 159-190 speedup compared to sequential execution.