Application of the ParalleX execution model to stencil-based problems

Authors:
T. Heller;H. Kaiser;K. Iglberger
Affiliations:
Friedrich-Alexander University Erlangen-Nuremberg, Erlangen, Germany 91058;Center for Computation and Technology, Louisiana State University, Baton Rouge, USA 70803;Friedrich-Alexander University Erlangen-Nuremberg, Erlangen, Germany 91058
Venue:
Computer Science - Research and Development
Year:
2013

Citing 5
Cited 1

A Theory of Communicating Sequential Processes

Journal of the ACM (JACM)
Monsoon: an explicit token-store architecture

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
OpenMP: An Industry-Standard API for Shared-Memory Programming

IEEE Computational Science & Engineering
ParalleX An Advanced Parallel Execution Model for Scaling-Impaired Applications

ICPPW '09 Proceedings of the 2009 International Conference on Parallel Processing Workshops
Introduction to High Performance Computing for Scientists and Engineers

Introduction to High Performance Computing for Scientists and Engineers

Using HPX and LibGeoDecomp for scaling HPC applications on heterogeneous supercomputers

ScalA '13 Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the prospect of the upcoming exa-scale era with millions of execution units, the question of how to deal with this level of parallelism efficiently is of time-critical relevance. State-of-the-Art parallelization techniques such as OpenMP and MPI are not guaranteed to solve the expected problems of starvation, growing latencies, overheads, and contention. On the other hand, new parallelization paradigms promise to efficiently hide latencies and contain starvation and contention.In this paper we analyze the performance of one novel parallelization strategy for shared and distributed memory machines. We will focus on shared memory architectures and compare the performance of the ParalleX execution model against the quasi-standard OpenMP for a standard stencil-based problem. We compare in detail the OpenMP implementation of two applications of Jacobi solvers (one based on regular grid and one based on an irregular grid structure) with the corresponding implementation of these applications using HPX (High Performance ParalleX), the first feature-complete, open-source implementation of ParalleX, and analyze the results of both implementations on a multi-socket NUMA node.