Preliminary design examination of the ParalleX system from a software and hardware perspective

Authors:
Alexandre Tabbal;Matthew Anderson;Maciej Brodowicz;Hartmut Kaiser;Thomas Sterling
Affiliations:
LSU, Baton Rouge, LA, USA;LSU, Baton Rouge, LA, USA;LSU, Baton Rouge, LA, USA;LSU, Baton Rouge, LA, USA;LSU, Baton Rouge, LA, USA
Venue:
ACM SIGMETRICS Performance Evaluation Review - Special issue on the 1st international workshop on performance modeling, benchmarking and simulation of high performance computing systems (PMBS 10)
Year:
2011

Citing 6
Cited 2

Monsoon: an explicit token-store architecture

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
The architecture of the DIVA processing-in-memory chip

ICS '02 Proceedings of the 16th international conference on Supercomputing
Processing in Memory: The Terasys Massively Parallel PIM Array

Computer
GASNet Specification, v1.1

GASNet Specification, v1.1
Improving code compression using clustered modalities

Proceedings of the 46th Annual Southeast Regional Conference on XX
ParalleX An Advanced Parallel Execution Model for Scaling-Impaired Applications

ICPPW '09 Proceedings of the 2009 International Conference on Parallel Processing Workshops

Improving the scalability of parallel N-body applications with an event-driven constraint-based execution model

International Journal of High Performance Computing Applications
Using HPX and LibGeoDecomp for scaling HPC applications on heterogeneous supercomputers

ScalA '13 Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Exascale systems, expected to emerge by the end of the next decade, will require the exploitation of billion-way parallelism at multiple hierarchical levels in order to achieve the desired sustained performance. While traditional approaches to performance evaluation involve measurements of existing applications on the available platforms, such a methodology is obviously unsuitable for architectures still at the brainstorming stage. The prediction of the future machine performance is an important factor driving the design of both the execution hardware and software environment. A good way to start assessing the performance is to identify the factors challenging the scalability of parallel applications. We believe the root cause of these challenges is the incoherent coupling between the current enabling technologies, such as Non-Uniform Memory Access of present multicore nodes equipped with optional hardware accelerators and the decades older execution model, i.e., Communicating Sequential Processes (CSP). Supercomputing is in the midst of a much needed phase change and the High-Performance Computing community is slowly realizing the necessity for a new design dogma, as affirmed in the preliminary Exascale studies. In this paper, we present an overview of the ParalleX execution model and its complementary design efforts at the software and hardware levels, while including power draw of the system as the resource of utmost importance. Since the interplay of hardware and software environment is quickly becoming one of the dominant factors in the design of well integrated, energy efficient, large-scale systems, we also explore the implications of the ParalleX model on the organization of parallel computing architectures. We also present scaling and performance results for an adaptive mesh refinement application developed using a ParalleX-compliant runtime system implementation, HPX.