Analyses for the translation of OpenMP codes into SPMD style with array privatization

Authors:
Zhenying Liu;Barbara Chapman;Yi Wen;Lei Huang;Tien-Hsiung Weng;Oscar Hernandez
Affiliations:
Department of Computer Science, University of Houston;Department of Computer Science, University of Houston;Department of Computer Science, University of Houston;Department of Computer Science, University of Houston;Department of Computer Science, University of Houston;Department of Computer Science, University of Houston
Venue:
WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
Year:
2003

Citing 15
Cited 5

Direct parallelization of call statements

SIGPLAN '86 Proceedings of the 1986 SIGPLAN symposium on Compiler construction
Analysis of interprocedural side effects in a parallel programming environment

Journal of Parallel and Distributed Computing - Special Issue on Languages, Compilers and environments for Parallel Programming
A technique for summarizing data access and its use in parallelism enhancing transformations

PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
Efficient call graph analysis

ACM Letters on Programming Languages and Systems (LOPLAS)
PARADIGM: a compiler for automatic data distribution on multicomputers

ICS '93 Proceedings of the 7th international conference on Supercomputing
Automatic data layout for high performance Fortran

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Data distribution support on distributed shared memory multiprocessors

Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
Is data distribution necessary in OpenMP?

Proceedings of the 2000 ACM/IEEE conference on Supercomputing
An Advanced Compiler Framework for Non-Cache-Coherent Multiprocessors

IEEE Transactions on Parallel and Distributed Systems
An Implementation of Interprocedural Bounded Regular Section Analysis

IEEE Transactions on Parallel and Distributed Systems
Complex Pipelined Executions in OpenMP Parallel Applications

ICPP '02 Proceedings of the 2001 International Conference on Parallel Processing
Interprocedural Array Alignment Analysis

HPCN Europe 1998 Proceedings of the International Conference and Exhibition on High-Performance Computing and Networking
Extending OpenMP for NUMA machines

Scientific Programming
Constructing the Call Graph of a Program

IEEE Transactions on Software Engineering
Improving the performance of OpenMP by array privatization

WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming

Towards optimisation of openMP codes for synchronisation and data reuse

International Journal of High Performance Computing and Networking
Toward enhancing OpenMP's work-sharing directives

Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
A tool to display array access patterns in OpenMP programs

PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Performance of parallel bit-reversal with cilk and UPC for fast fourier transform

GPC'10 Proceedings of the 5th international conference on Advances in Grid and Pervasive Computing
Strategies and implementation for translating OpenMP code for clusters

HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications

Quantified Score

Hi-index	0.00

Visualization

Abstract

A so-called SPMD style OpenMP program can achieve scalability on ccNUMA systems by means of array privatization, and earlier research has shown good performance under this approach. Since it is hard to write SPMD OpenMP code, we showed a strategy for the automatic translation of many OpenMP constructs into SPMD style in our previous work. In this paper, we first explain how to interprocedurally detect whether the OpenMP program consistently schedules the parallel loops. If the parallel loops are consistently scheduled, we may carry out array privatization according to OpenMP semantics. We give two examples of code patterns that can be handled despite the fact that they are not consistent, and where the strategy used to translate them differs from the straightforward approach that can otherwise be applied.