The input/output complexity of sorting and related problems
Communications of the ACM
Time/space tradeoffs for polygon mesh rendering
ACM Transactions on Graphics (TOG)
Large scale terrain visualization using the restricted quadtree triangulation
Proceedings of the conference on Visualization '98
Optimization of mesh locality for transparent vertex caching
Proceedings of the 26th annual conference on Computer graphics and interactive techniques
Cache oblivious stencil computations
Proceedings of the 19th annual international conference on Supercomputing
A refinement-tree based partitioning method for dynamic load balancing with adaptively refined grids
Journal of Parallel and Distributed Computing
International Journal of Computational Science and Engineering
SIAM Journal on Scientific Computing
Cluster optimization and parallelization of simulations with dynamically adaptive grids
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Hi-index | 0.00 |
Adaptive mesh refinement and iterative traversals of unknowns on such adaptive grids are fundamental building blocks for PDE solvers. We discuss a respective integrated approach for grid refinement and processing of unknowns that is based on recursively structured triangular grids and space-filling element orders. In earlier work, the approach was demonstrated to be highly memory- and cache-efficient. In this paper, we analyse the cache efficiency of the traversal algorithms using the I/O model. Further, we discuss how the nested recursive traversal algorithms can be efficiently implemented. For that purpose, we compare the memory throughput of respective implementations with simple stream benchmarks, and study the dependence of memory throughput and floating point performance from the computational load per element.