An adaptive finite element scheme for transient problems in CFD
Computer Methods in Applied Mechanics and Engineering
Local adaptive mesh refinement for shock hydrodynamics
Journal of Computational Physics
A parallel hashed Oct-Tree N-body algorithm
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Statistical scalability analysis of communication operations in distributed applications
PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
Large scale parallel structured AMR calculations using the SAMRAI framework
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
A case study in application I/O on Linux clusters
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
An Application-Centric Characterization of Domain-Based SFC Partitioners for Parallel SAMR
IEEE Transactions on Parallel and Distributed Systems
An Integrated Performance Visualizer for MPI/OpenMP Programs
WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
Enhancing scalability of parallel structured AMR calculations
ICS '03 Proceedings of the 17th annual international conference on Supercomputing
Validating Astrophysical Simulation Codes
Computing in Science and Engineering
Metrics and models for reordering transformations
MSP '04 Proceedings of the 2004 workshop on Memory system performance
Scalable Parallel Octree Meshing for TeraScale Applications
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Hybrid Runtime Management of Space-Time Heterogeneity for Parallel Structured Adaptive Applications
IEEE Transactions on Parallel and Distributed Systems
Scalable adaptive mantle convection simulation on petascale supercomputers
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
p4est: Scalable Algorithms for Parallel Adaptive Mesh Refinement on Forests of Octrees
SIAM Journal on Scientific Computing
Performance analysis of Intel multiprocessors using astrophysics simulations
Concurrency and Computation: Practice & Experience
Pragmatic optimizations for better scientific utilization of large supercomputers
International Journal of High Performance Computing Applications
Hi-index | 0.00 |
We present simulations and performance results of nuclear burning fronts in supernovae on the largest domain and at the finest spatial resolution studied to date. These simulations were performed on the Intel ASCI-Red machine at Sandia National Laboratories using FLASH, a code developed at the Center for Astrophysical Thermonuclear Flashes at the University of Chicago. FLASH is a modular, adaptive mesh, parallel simulation code capable of handling compressible, reactive fluid flows inastrophysical environments. FLASH is written primarily in Fortran 90, uses the Message-Passing Interface library for inter-processor communication and portability, and employs the PARAMESH package to manage a block-structured adaptive mesh that places blocks only where resolution is required and tracks rapidly changing flow features, such as detonation fronts, with ease. We describe the key algorithms and their implementation as well as the optimizations required to achieve sustained performance of 238 GFLOPS on 6420 processors of ASCI-Red in 64 bit arithmetic.