On Privatization of Variables for Data-Parallel Execution

Authors:
Manish Gupta
Affiliations:
-
Venue:
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Year:
1997

Citing 15
Cited 8

Advanced compiler optimizations for supercomputers

Communications of the ACM - Special issue on parallelism
Array expansion

ICS '88 Proceedings of the 2nd international conference on Supercomputing
Efficiently computing static single assignment form and the control dependence graph

ACM Transactions on Programming Languages and Systems (TOPLAS)
Compiling Fortran D for MIMD distributed-memory machines

Communications of the ACM
The high performance Fortran handbook

The high performance Fortran handbook
Mobile and replicated alignment of arrays in data-parallel programs

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Symbolic array dataflow analysis for array privatization and program parallelization

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Detecting coarse-grain parallelism using an interprocedural parallelizing compiler

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
An HPF compiler for the IBM SP2

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
The Paradigm Compiler for Distributed-Memory Multicomputers

Computer
Compiling Global Name-Space Parallel Loops for Distributed Execution

IEEE Transactions on Parallel and Distributed Systems
Experience in the Automatic Parallelization of Four Perfect-Benchmark Programs

Proceedings of the Fourth International Workshop on Languages and Compilers for Parallel Computing
Automatic Array Privatization

Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing
A Compilation Approach for Fortran 90D/ HPF Compilers

Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing
(R) Compiler Support for Privatization on Distributed - Memory Machines

ICPP '96 Proceedings of the Proceedings of the 1996 International Conference on Parallel Processing - Volume 3

Automatic data and computation decomposition on distributed memory parallel computers

ACM Transactions on Programming Languages and Systems (TOPLAS)
MARS: A Distributed Memory Approach to Shared Memory Compilation

LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Automatic privatization for parallel execution of loops

ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
Execution privatization for scheduler-oblivious concurrent programs

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
Improved loop tiling based on the removal of spurious false dependences

ACM Transactions on Architecture and Code Optimization (TACO) - Special Issue on High-Performance Embedded Architectures and Compilers
Programming with relaxed synchronization

Proceedings of the 2012 ACM workshop on Relaxing synchronization for multicore and manycore scalability
General data structure expansion for multi-threading

Proceedings of the 34th ACM SIGPLAN conference on Programming language design and implementation
Optimising space exploration of OpenCL for GPGPUs

International Journal of Computational Science and Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Privatization of data is an important technique that has been used bycompilers to parallelize loops by eliminating storage-related dependences. When a compiler partitions computations based on the ownership of data, selecting a proper mapping of privatizable data is cru- cial to obtaining the bene.ts of privatization. This pa- per presents a novel framework for privatizing scalar and array variables in the context of a data-driven ap- proach to parallelization. We show that there are nu- merous alternatives available for mapping privatized variables and the choice of mapping can signi.cantly a.ect the performance of the program. We present an algorithm that attempts to preserve parallelism and minimize communication overheads. We also introduce the concept of partial privatization of arrays that combines data partitioning and privatization, and enables e.cient handling of a class of codes with multi-dimensional data distribution that was not previously possible. Finally, we show how the ideas of privatization apply to the execution of control ow statements as well. An implementation of these ideas in the pHPF prototype compiler for High Performance Fortran on the IBM SP2 machine has shown impressive results.