High Resolution Aerospace Applications using the NASA Columbia Supercomputer

Authors:
Dimitri J. Mavriplis;Michael J. Aftosmis;Marsha Berger
Affiliations:
University of Wyoming, Laramie;NASA Ames Research Center, Moffett Field, CA;Courant Institute, New York University
Venue:
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Year:
2005

Citing 6
Cited 10

Using MPI: portable parallel programming with the message-passing interface

Using MPI: portable parallel programming with the message-passing interface
Multigrid strategies for viscous flow solvers on anisotropic unstructured meshes

Journal of Computational Physics
Parallel performance investigations of an unstructured mesh Navier-Stokes solver

Parallel performance investigations of an unstructured mesh Navier-Stokes solver
Performance of a new CFD flow solver using a hybrid programming paradigm

Journal of Parallel and Distributed Computing
An Application-Based Performance Characterization of the Columbia Supercluster

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Digital Flight: The Last CFD Aeronautical Grand Challenge

Journal of Scientific Computing

Mapping with Space Filling Surfaces

IEEE Transactions on Parallel and Distributed Systems
Low-constant parallel algorithms for finite element simulations using linear octrees

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Scientific application-based performance comparison of SGI Altix 4700, IBM POWER5+, and SGI ICE 8200 supercomputers

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Dendro: parallel algorithms for multigrid and AMR methods on 2:1 balanced octrees

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Scalable adaptive mantle convection simulation on petascale supercomputers

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Early performance evaluation of a "Nehalem" cluster using scientific and engineering applications

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
A Parallel Geometric Multigrid Method for Finite Elements on Octree Meshes

SIAM Journal on Scientific Computing
Impact of the columbia supercomputer on NASA science and engineering applications

IWDC'05 Proceedings of the 7th international conference on Distributed Computing
Parallel geometric-algebraic multigrid on unstructured forests of octrees

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
An early performance evaluation of many integrated core architecture based SGI rackable computing system

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper focuses on the parallel performance of two high-performance aerodynamic simulation packages on the newly installed NASA Columbia supercomputer. These packages include both a high-fidelity, unstructured, Reynolds-averaged Navier-Stokes solver, and a fully-automated inviscid flow package for cut-cell Cartesian grids. The complementary combination of these two simulation codes enables high-fidelity characterization of aerospace vehicle design performance over the entire flight envelope through extensive parametric analysis and detailed simulation of critical regions of the flight envelope. Both packages are industrial-level codes designed for complex geometry and incorporate customized multigrid solution algorithms. The performance of these codes on Columbia is examined using both MPI and OpenMP and using both the NUMAlink and InfiniBand interconnect fabrics. Numerical results demonstrate good scalability on up to 2016 cpus using the NUMAlink4 interconnect, with measured computational rates in the vicinity of 3 TFLOP/s, while InfiniBand showed some performance degradation at high CPU counts, particularly with multigrid. Nonetheless, the results are encouraging enough to indicate that larger test cases using combined MPI/OpenMP communication should scale well on even more processors.