Fast Fourier transforms for direct solution of Poisson's equation with staggered boundary conditions
Journal of Computational Physics
Ordered fast Fourier transforms on a massively parallel hypercube multiprocessor
Journal of Parallel and Distributed Computing
Computational frameworks for the fast Fourier transform
Computational frameworks for the fast Fourier transform
Domain decomposition: parallel multilevel methods for elliptic partial differential equations
Domain decomposition: parallel multilevel methods for elliptic partial differential equations
A Fast Poisson Solver of Arbitrary Order Accuracy in Rectangular Regions
SIAM Journal on Scientific Computing
SIAM Journal on Scientific Computing
The Scalability of FFT on Parallel Computers
IEEE Transactions on Parallel and Distributed Systems
Parallelization of the French Meteorological Mesoscale Model MésoNH
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Poisson inverse gradient approach to vascular myocyte detection and segmentation
ISBI'09 Proceedings of the Sixth IEEE international conference on Symposium on Biomedical Imaging: From Nano to Macro
On the communication complexity of 3D FFTs and its implications for Exascale
Proceedings of the 26th ACM international conference on Supercomputing
Determination of performance characteristics of scientific applications on IBM Blue Gene/Q
IBM Journal of Research and Development
Hi-index | 0.00 |
Fast Poisson solvers based on fast Fourier transform (FFT) computations are among the fastest techniques to solve Poisson equation on uniform grids. In this paper, the authors present two parallel distributed implementations of a 3-D fast Poisson solver in the context of the atmospheric simulation code. The first approach consists of implementing data movement between each computational step so that no elementary computational routine involves communication. The second approach aims at reducing the global data movement and requires the parallelization of one step of the fast Poisson solver. Experimental results on a 128-node Cray T3E are presented to illustrate the advantages and drawbacks of each approach.