High-performacne parallel implicit CFD
Parallel Computing - Special issue on parallel computing in aerospace
Recursion Unrolling for Divide and Conquer Programs
LCPC '00 Proceedings of the 13th International Workshop on Languages and Compilers for Parallel Computing-Revised Papers
The Genesis of Attribute Grammars
Proceedings of the International Conference WAGA on Attribute Grammars and their Applications
Uintah: A Massively Parallel Problem Solving Environment
HPDC '00 Proceedings of the 9th IEEE International Symposium on High Performance Distributed Computing
A Cache-Aware Algorithm for PDEs on Hierarchical Data Structures Based on Space-Filling Curves
SIAM Journal on Scientific Computing
Hierarchical hybrid grids: achieving TERAFLOP performance on large scale finite element simulations
International Journal of Parallel, Emergent and Distributed Systems
Numerical Simulation of Particle Transport in a Drift Ratchet
SIAM Journal on Scientific Computing
Towards multi-phase flow simulations in the PDE framework Peano
Computational Mechanics
Autotuning of adaptive mesh refinement PDE solvers on shared memory architectures
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Cluster optimization and parallelization of simulations with dynamically adaptive grids
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Hi-index | 0.00 |
This paper analyses a PDE solver working on adaptive Cartesian grids. While a rigorous element-wise formulation of this solver offers great flexibility concerning dynamic adaptivity, and while it comes along with very low memory requirements, the realisation's speed can not cope with codes working on patches of regular grids--in particular, if the latter deploy patches to several cores. Instead of composing a grid of regular patches, we suggest to identify regular patches throughout the recursive, element-wise grid traversal. Our code then unrolls the recursion for these regular grid blocks automatically, and it deploys their computations to several cores. It hence benefits from multicores on regular subdomains, but preserves its simple, element-wise character and its ability to handle arbitrary dynamic refinement and domain topology changes.