Inferring arbitrary distributions for data and computation

Authors:
Soham Sundar Chakraborty;V. Krishna Nandivada
Affiliations:
TCS, Pune, India;IBM, Bangalore, India
Venue:
Proceedings of the ACM international conference companion on Object oriented programming systems languages and applications companion
Year:
2010

Citing 19
Cited 0

Memory storage patterns in parallel processing

Memory storage patterns in parallel processing
Semi-automatic process partitioning for parallel computation

International Journal of Parallel Programming
Process decomposition through locality of reference

PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
A denotational semantics of inheritance and its correctness

OOPSLA '89 Conference proceedings on Object-oriented programming systems, languages and applications
Analysis and transformation in the ParaScope editor

ICS '91 Proceedings of the 5th international conference on Supercomputing
PARADIGM: a compiler for automatic data distribution on multicomputers

ICS '93 Proceedings of the 7th international conference on Supercomputing
DDT: a research tool for automatic data distribution in high performance Fortran

Scientific Programming - Special issue: High Performance Fortran comes of age
Efficient Algorithms for Data Distribution on Distributed Memory Parallel Computers

IEEE Transactions on Parallel and Distributed Systems
SUIF Explorer: an interactive and interprocedural parallelizer

Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Reverse execution of programs

ACM SIGPLAN Notices
Compilation techniques for parallel systems

Parallel Computing - Special Anniversary issue
Optimizing compilers for modern architectures: a dependence-based approach

Optimizing compilers for modern architectures: a dependence-based approach
Automatic data and computation decomposition on distributed memory parallel computers

ACM Transactions on Programming Languages and Systems (TOPLAS)
Array Distribution in Data-Parallel Programs

LCPC '94 Proceedings of the 7th International Workshop on Languages and Compilers for Parallel Computing
An Incremental Programming Environment

IEEE Transactions on Software Engineering
The PARSEC benchmark suite: characterization and architectural implications

Proceedings of the 17th international conference on Parallel architectures and compilation techniques
Static Detection of Place Locality and Elimination of Runtime Checks

APLAS '08 Proceedings of the 6th Asian Symposium on Programming Languages and Systems
Towards concurrency refactoring for x10

Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
Refactoring sequential Java code for concurrency via concurrent libraries

ICSE '09 Proceedings of the 31st International Conference on Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the era of mult-core systems, one of the key requirements of achieving better utilization of multiple available cores is that of parallelization of code across multiple distributed nodes; this involves (re)distribution of both data and computation. Such a transformation can be a fairly tedious activity considering the possible dependencies (data, control) and interference between different segments of the code. Further, to keep the data accesses local, computation distribution requires appropriate data distribution and vice versa. And this inter-dependence between distribution of data and computation makes the problem challenging. Another important challenge in this context is that the desired distribution may not be one of the well-known distributions (such as blocked, cyclic etc), and thus reasoning about it can be non-trivial.We present a refactoring framework that can help an application developer to incrementally distribute programs in the context of distributed memory multi-core systems. Given a loop and an array accessed therein, the goal of our framework is to distribute the array based on a specified distribution for the loop (or vice versa) such that the number of remote accessed are reduced. Our framework goes beyond the well-known distributions, and can handle any arbitrary distributions. In our initial investigation, we have used our transformations on varied parallel benchmark programs and have been able to show its applicability along the expected lines.