High performance reactive fluid flow simulations using adaptive mesh refinement on thousands of processors

Authors:
A. C. Calder;B. C. Curtis;L. J. Dursi;B. Fryxell;P. MacNeice;K. Olson;P. Ricker;R. Rosner;F. X. Timmes;H. M. Tufo;J. W. Turan;M. Zingale;G. Henry
Affiliations:
Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL;Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA;Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL;Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL;Drexel University, Philadelphia, PA and NASA Goddard Space Flight Center, Greenbelt, MD;Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL and NASA Goddard Space Flight Center, Greenbelt, MD;Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL;Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL;Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL;Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL;Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL;Center for Astrophysical Thermonuclear Flashes, The University of Chicago, Chicago, IL;Intel Corporation, Santa Clara, CA
Venue:
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Year:
2000

Citing 3
Cited 15

An adaptive finite element scheme for transient problems in CFD

Computer Methods in Applied Mechanics and Engineering
Local adaptive mesh refinement for shock hydrodynamics

Journal of Computational Physics
A parallel hashed Oct-Tree N-body algorithm

Proceedings of the 1993 ACM/IEEE conference on Supercomputing

Statistical scalability analysis of communication operations in distributed applications

PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
Large scale parallel structured AMR calculations using the SAMRAI framework

Proceedings of the 2001 ACM/IEEE conference on Supercomputing
A case study in application I/O on Linux clusters

Proceedings of the 2001 ACM/IEEE conference on Supercomputing
An Application-Centric Characterization of Domain-Based SFC Partitioners for Parallel SAMR

IEEE Transactions on Parallel and Distributed Systems
An Integrated Performance Visualizer for MPI/OpenMP Programs

WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
Enhancing scalability of parallel structured AMR calculations

ICS '03 Proceedings of the 17th annual international conference on Supercomputing
Validating Astrophysical Simulation Codes

Computing in Science and Engineering
Metrics and models for reordering transformations

MSP '04 Proceedings of the 2004 workshop on Memory system performance
Scalable Parallel Octree Meshing for TeraScale Applications

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Hybrid Runtime Management of Space-Time Heterogeneity for Parallel Structured Adaptive Applications

IEEE Transactions on Parallel and Distributed Systems
Scalable adaptive mantle convection simulation on petascale supercomputers

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Extreme-Scale AMR

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
p4est: Scalable Algorithms for Parallel Adaptive Mesh Refinement on Forests of Octrees

SIAM Journal on Scientific Computing
Performance analysis of Intel multiprocessors using astrophysics simulations

Concurrency and Computation: Practice & Experience
Pragmatic optimizations for better scientific utilization of large supercomputers

International Journal of High Performance Computing Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present simulations and performance results of nuclear burning fronts in supernovae on the largest domain and at the finest spatial resolution studied to date. These simulations were performed on the Intel ASCI-Red machine at Sandia National Laboratories using FLASH, a code developed at the Center for Astrophysical Thermonuclear Flashes at the University of Chicago. FLASH is a modular, adaptive mesh, parallel simulation code capable of handling compressible, reactive fluid flows inastrophysical environments. FLASH is written primarily in Fortran 90, uses the Message-Passing Interface library for inter-processor communication and portability, and employs the PARAMESH package to manage a block-structured adaptive mesh that places blocks only where resolution is required and tracks rapidly changing flow features, such as detonation fronts, with ease. We describe the key algorithms and their implementation as well as the optimizations required to achieve sustained performance of 238 GFLOPS on 6420 processors of ASCI-Red in 64 bit arithmetic.