Optimizing collective I/O performance on parallel computers: a multisystem study
ICS '97 Proceedings of the 11th international conference on Supercomputing
Performance tuning and evaluation of a parallel community climate model
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Performance evaluation of the IBM SP and the Compaq AlphaServer SC
Proceedings of the 14th international conference on Supercomputing
IEEE Transactions on Parallel and Distributed Systems
FOAM: expanding the horizons of climate modeling
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
Scaling the unscalable: a case study on the AlphaServer SC
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
A framework for performance modeling and prediction
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 3 - Volume 04
Performance Portability in the Physical Parameterizations of the Community Atmospheric Model
International Journal of High Performance Computing Applications
Efficient Parallel I/O in Community Atmosphere Model (CAM)
International Journal of High Performance Computing Applications
Algorithm 888: Spherical Harmonic Transform Algorithms
ACM Transactions on Mathematical Software (TOMS)
Parallelization of a Vlasov-Maxwell solver in four-dimensional phase space
Parallel Computing
An evaluation of MPI and OpenMP paradigms for multi-dimensional data remapping
WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
Hi-index | 0.00 |
The spectral transform method is a standard numerical technique for solving partial differential equations on a sphere and is widely used in atmospheric circulation models. Recent research has identified several promising algorithms for implementing this method on massively parallel computers; however, no detailed comparison of the different algorithms has previously been attempted. In this paper, we describe these different parallel algorithms and report on computational experiments that we have conducted to evaluate their efficiency on parallel computers. The experiments used a testbed code that solves the nonlinear shallow water equations on a sphere; considerable care was taken to ensure that the experiments provide a fair comparison of the different algorithms and that the results are relevant to global models. We focus on hypercube- and mesh-connected multicomputers with cut-through routing, such as the Intel iPSC/860, DELTA, and Paragon, and the nCUBE/2, but we also indicate how the results extend to other parallel computer architectures. The results of this study are relevant not only to the spectral transform method but also to multidimensional fast Fourier transforms (FFTs) and other parallel transforms.