Parallel multilevel k-way partitioning scheme for irregular graphs
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Theory of Self-Reproducing Automata
Theory of Self-Reproducing Automata
Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
LibGeoDecomp: A Grid-Enabled Library for Geometric Decomposition Codes
Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
3.5-D Blocking Optimization for Stencil Computations on Modern CPUs and GPUs
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
In this paper we present our findings from parallelizing a material science application which simulates dendritic growth in molten metal alloys. The simulation itself is based on an iterative 2D meshfree model. The simulation cells are tightly coupled and depend on neighbors in a relatively large radius, so the code turned out to be communication bound. We present two different approaches for the parallelization: one specifically written for this application, and one which uses LibGeoDecomp, a stencil code library. Benchmarks show that the stencil code library performs much better than expected, despite not being designed for this use case.