Predictive performance and scalability modeling of a large-scale application
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Dynamic Load Balancing for Structured Adaptive Mesh Refinement Applications
ICPP '02 Proceedings of the 2001 International Conference on Parallel Processing
Dynamic Load Balancing for Parallel Adaptive Mesh Refinement
IRREGULAR '98 Proceedings of the 5th International Symposium on Solving Irregularly Structured Problems in Parallel
On Partitioning Dynamic Adaptive Grid Hierarchies
HICSS '96 Proceedings of the 29th Hawaii International Conference on System Sciences Volume 1: Software Technology and Architecture
Hierarchical Partitioning Techniques for Structured Adaptive Mesh Refinement Applications
The Journal of Supercomputing
ACM SIGMETRICS Performance Evaluation Review - Special issue on the 1st international workshop on performance modeling, benchmarking and simulation of high performance computing systems (PMBS 10)
Exascale computing technology challenges
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
A Simulator for Large-Scale Parallel Computer Architectures
International Journal of Distributed Systems and Technologies
Semi-automatic extraction of software skeletons for benchmarking large-scale parallel applications
Proceedings of the 2013 ACM SIGSIM conference on Principles of advanced discrete simulation
Validation and uncertainty assessment of extreme-scale HPC simulation through bayesian inference
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Hi-index | 0.00 |
The ability to predict the performance of irregular, asynchronous applications on future hardware is essential to the exascale co-design process. Adaptive Mesh Refinement (AMR) applications are inherently irregular and dynamic in their computation and communication patterns, resulting in complex hardware/software interactions. We have developed a methodology to use architectural simulators to assess the performance of different AMR data placement strategies on a selection of potential hardware interconnect topologies for exascale-class supercomputers. We use our framework to study the CASTRO AMR compressible astrophysics code for the simulation of supernovae. The results show a performance improvement of up to 18 percent may be obtained through the use of locality-aware data distributions for some network topologies on an exascale-class supercomputer.