Design and performance of a scalable parallel community climate model
Parallel Computing - Special issue: climate and weather modeling
Performance of a distributed memory finite difference atmospheric general circulation model
Parallel Computing - Special issue: climate and weather modeling
OpenMP: An Industry-Standard API for Shared-Memory Programming
IEEE Computational Science & Engineering
Parallel Grid Manipulations for General Circulation Models
PPAM '01 Proceedings of the th International Conference on Parallel Processing and Applied Mathematics-Revised Papers
International Journal of High Performance Computing Applications
Overview of the Software Design of the Community Climate System Model
International Journal of High Performance Computing Applications
Performance Portability in the Physical Parameterizations of the Community Atmospheric Model
International Journal of High Performance Computing Applications
Leading Computational Methods on Scalar and Vector HEC Platforms
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
The implementation of the finite-volume dynamical core in the community atmosphere model
Journal of Computational and Applied Mathematics
Design issues in the implementation of MPI2 one sided communication in Ethernet based networks
PDCN'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: parallel and distributed computing and networks
Early evaluation of IBM BlueGene/P
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Dynamic Component Extension: a Strategy for Performance Improvement in Multicomponent Applications
International Journal of High Performance Computing Applications
Scalable Detection of MPI-2 Remote Memory Access Inefficiency Patterns
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application
Proceedings of the 24th ACM International Conference on Supercomputing
Performance of the community earth system model
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Specification of inefficiency patterns for MPI-2 one-sided communication
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Event-Based measurement and analysis of one-sided communication
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Improving the performance scalability of the community atmosphere model
International Journal of High Performance Computing Applications
CAM-SE: A scalable spectral element dynamical core for the Community Atmosphere Model
International Journal of High Performance Computing Applications
Scalable detection of MPI-2 remote memory access inefficiency patterns
International Journal of High Performance Computing Applications
Enabling highly-scalable remote memory access programming with MPI-3 one sided
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
We present a distributed memory message passing parallel implementation of a finite-volume discretization of the primitive equations in the Community Atmosphere Model. Due to the data dependences resulting from the polar singularity of the latitude-longitude coordinate system, we employ two separate domain decompositions within the dynamical core: one in latitude/level space and the other in longitude/latitude space. This requires that the data be periodically redistributed between these two decompositions. In addition, the domains contain halo regions that cover the nearest-neighbor data dependences. A combination of several techniques, such as one-sided communication and multithreading, are presented to optimize data movements. The resulting algorithm is shown to scale to very large machine configurations, even for relatively coarse resolutions.