A fast algorithm for particle simulations
Journal of Computational Physics
Implications of hierarchical N-body methods for multiprocessor architectures
ACM Transactions on Computer Systems (TOCS)
Fast parallel algorithms for short-range molecular dynamics
Journal of Computational Physics
Journal of Parallel and Distributed Computing
Fast Fourier Transform Accelerated Fast Multipole Algorithm
SIAM Journal on Scientific Computing
Journal of Computational Physics
The parallel fast multipole method in molecular dynamics
The parallel fast multipole method in molecular dynamics
High performance Fortran for highly irregular problems
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
A fast adaptive multipole algorithm in three dimensions
Journal of Computational Physics
A data-parallel implementation of O(N) hierarchical N-body methods
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Highly portable and efficient implementations of parallel adaptive N-body methods
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
A new version of the fast multipole method for screened Coulomb interactions in three dimensions
Journal of Computational Physics
An overview of the BlueGene/L Supercomputer
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
NAMD: biomolecular simulation on thousands of processors
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Efficient parallel implementations of multipole based n-body algorithms
Efficient parallel implementations of multipole based n-body algorithms
Virtual memory on data diffusion architectures
Parallel Computing
Communications overlapping in fast multipole particle dynamics methods
Journal of Computational Physics
Automatic Generation of FFT for Translations of Multipole Expansions in Spherical Harmonics
International Journal of High Performance Computing Applications
Fast electrostatic force calculation on parallel computer clusters
Journal of Computational Physics
Latency-Optimized Parallelization of the FMM Near-Field Computations
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
A massively parallel adaptive fast-multipole method on heterogeneous architectures
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
A massively parallel adaptive fast multipole method on heterogeneous architectures
Communications of the ACM
A CPU: GPU Hybrid Implementation and Model-Driven Scheduling of the Fast Multipole Method
Proceedings of Workshop on General Purpose Processing Using GPUs
Hi-index | 0.02 |
We present a new load balanced parallel implementation of a non-adaptive version of Greengard and Rokhlin's fast multipole method for distributed memory architectures with focus on applications in molecular dynamics. We introduce a novel load balancing and communication overlapping scheme. Our implementation includes periodic boundary conditions calculations and facilitates multiple time stepping techniques without sacrificing determinism of computation and scales to hundreds of processor for systems of only O(10k) atoms.