Journal of Computational Physics
Weighted essentially non-oscillatory schemes
Journal of Computational Physics
Cilk: an efficient multithreaded runtime system
PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Efficient implementation of weighted ENO schemes
Journal of Computational Physics
Scheduling multithreaded computations by work stealing
Journal of the ACM (JACM)
A Simple Method for Compressible Multifluid Flows
SIAM Journal on Scientific Computing
Design Patterns: Abstraction and Reuse of Object-Oriented Design
ECOOP '93 Proceedings of the 7th European Conference on Object-Oriented Programming
A flux-split algorithm applied to conservative models for multicomponent compressible flows
Journal of Computational Physics
Impact of modern memory subsystems on cache optimizations for stencil computations
Proceedings of the 2005 workshop on Memory system performance
A multi-phase SPH method for macroscopic and mesoscopic flows
Journal of Computational Physics
Implicit and explicit optimizations for stencil computations
Proceedings of the 2006 workshop on Memory system performance and correctness
Implementation of WENO schemes in compressible multicomponent flow problems
Journal of Computational Physics
A conservative interface method for compressible flows
Journal of Computational Physics
An adaptive mesh refinement benchmark for modern parallel programming languages
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Data and thread affinity in openmp programs
Proceedings of the 2008 workshop on Memory access on future processors: a solved problem?
IBM Journal of Research and Development
Roofline: an insightful visual performance model for multicore architectures
Communications of the ACM - A Direct Path to Dependable Software
Dynamic Task and Data Placement over NUMA Architectures: An OpenMP Runtime Perspective
IWOMP '09 Proceedings of the 5th International Workshop on OpenMP: Evolving OpenMP in an Age of Extreme Parallelism
Enabling high-performance memory migration for multithreaded applications on LINUX
IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
Uintah: a scalable framework for hazard analysis
Proceedings of the 2010 TeraGrid Conference
An interface capturing method for the simulation of multi-phase compressible flows
Journal of Computational Physics
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Importance of explicit vectorization for CPU and GPU software performance
Journal of Computational Physics
Using hybrid parallelism to improve memory use in the Uintah framework
Proceedings of the 2011 TeraGrid Conference: Extreme Digital Discovery
Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Petaflop biofluidics simulations on a two million-core system
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Hardware/software co-design for energy-efficient seismic modeling
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
SIAM Journal on Scientific Computing
GPU and APU computations of Finite Time Lyapunov Exponent fields
Journal of Computational Physics
Anti-diffusion interface sharpening technique for two-phase compressible flow simulations
Journal of Computational Physics
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
11 PFLOP/s simulations of cloud cavitation collapse
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
We present an open source, object-oriented software for high throughput Direct Numerical Simulations of compressible, two-phase flows. The Navier-Stokes equations are discretized on uniform grids using high order finite volume methods. The software exploits recent CPU micro-architectures by explicit vectorization and adopts NUMA-aware techniques as well as data and computation reordering. We report a compressible flow solver with unprecedented fractions of peak performance: 45% of the peak for a single node (nominal performance of 840 GFLOP/s) and 30% for a cluster of 47'000 cores (nominal performance of 0.8 PFLOP/s). We suggest that the present work may serve as a performance upper bound, regarding achievable GFLOP/s, for two-phase flow solvers using adaptive mesh refinement. The software enables 3D simulations of shock-bubble interaction including, for the first time, effects of diffusion and surface tension, by efficiently employing two hundred billion computational elements.