A balanced code placement framework

Authors:
Reinhard von Hanxleden;Ken Kennedy
Affiliations:
DaimlerChrysler AG, Berlin, Germany;Rice Univ., Houston, TX
Venue:
ACM Transactions on Programming Languages and Systems (TOPLAS)
Year:
2000

Citing 38
Cited 2

Elimination algorithms for data flow analysis

ACM Computing Surveys (CSUR)
A fast algorithm for code movement optimisation

ACM SIGPLAN Notices
Register assignment using code placement techniques

Computer Languages
Global value numbers and redundant computations

POPL '88 Proceedings of the 15th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Structured dataflow analysis for arrays and its use in an optimizing complier

Software—Practice & Experience
An interval-based approach to exhaustive and incremental interprocedural data-flow analysis

ACM Transactions on Programming Languages and Systems (TOPLAS)
Introduction to algorithms

Introduction to algorithms
Properties of data flow frameworks: a unified model

Acta Informatica
Detecting redundant accesses to array data

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Computer support for machine-independent parallel programming in Fortran D

Languages, compilers and run-time environments for distributed memory machines
How to analyze large programs efficiently and informatively

PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Evaluation of compiler optimizations for Fortran D on MIMD distributed memory machines

ICS '92 Proceedings of the 6th international conference on Supercomputing
A practical data flow framework for array reference analysis and its use in optimizations

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Communication optimization and code generation for distributed memory machines

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Complexity of bi-directional data flow analysis

POPL '93 Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
The high performance Fortran handbook

The high performance Fortran handbook
Automatic data partitioning on distributed memory multicomputers

Automatic data partitioning on distributed memory multicomputers
An elimination algorithm for bidirectional data flow problems using edge placement

ACM Transactions on Programming Languages and Systems (TOPLAS)
Scalar replacement in the presence of conditional control flow

Software—Practice & Experience
Optimal code motion: theory and practice

ACM Transactions on Programming Languages and Systems (TOPLAS)
Interprocedural partial redundancy elimination and its application to distributed memory compilation

PLDI '95 Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation
A solution to a problem with Morel and Renvoise's “Global optimization by suppression of partial redundancies”

ACM Transactions on Programming Languages and Systems (TOPLAS)
Some comments on “A solution to a problem with Morel and Renvoise's 'Global optimization by suppression of partial redundancies'”

ACM Transactions on Programming Languages and Systems (TOPLAS)
Practical adaption of the global optimization algorithm of Morel and Renvoise

ACM Transactions on Programming Languages and Systems (TOPLAS)
Combining flow and dependence analyses to expose redundant array accesses

International Journal of Parallel Programming
A Unified Framework for Optimizing Communication in Data-Parallel Programs

IEEE Transactions on Parallel and Distributed Systems
Compiler support for machine-independent parallelization of irregular problems

Compiler support for machine-independent parallelization of irregular problems
Making graphs reducible with controlled node splitting

ACM Transactions on Programming Languages and Systems (TOPLAS)
Complete removal of redundant expressions

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Register promotion by sparse partial redundancy elimination of loads and stores

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Cost-optimal code motion

ACM Transactions on Programming Languages and Systems (TOPLAS)
Global optimization by suppression of partial redundancies

Communications of the ACM
Compiler Analysis for Irregular Problems in Fortran D

Proceedings of the 5th International Workshop on Languages and Compilers for Parallel Computing
A Framework for Exploiting Data Availability to Opimize Communication

Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing
Optimal Distribution Assignment Placement

Euro-Par '97 Proceedings of the Third International Euro-Par Conference on Parallel Processing
Control flow analysis

Proceedings of a symposium on Compiler optimization
Global common subexpression elimination

Proceedings of a symposium on Compiler optimization
Path Profile Guided Partial Redundancy Elimination Using Speculation

ICCL '98 Proceedings of the 1998 International Conference on Computer Languages

Reducing NoC energy consumption through compiler-directed channel voltage scaling

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Delta Send-Recv for Dynamic Pipelining in MPI Programs

CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Give-N-Take is a code placement framework which uses a generic producer-consumer mechanism. An instance of this could be a communication step between a processor that computes (produces) some data, and other processors that subsequently reference (consume) these data in an expression. An advantage of Give-N-Take over traditional partial redundancy elimination techniques is its concept of production regions, instead of single locations, which can be beneficial for general latency hiding. Give-N-Take also guarantees balanced production, i.e., each production will be started and stopped exactly once. The framework can also take advantage of production coming “for free,” as induced by side effects, without disturbing balance. Give-N-Take can place production either before or after consumption, and it also provides the option to speculatively hoist code out of potentially zero-trip loop (nest) constructs. Give-N-Take uses a fast elimination method based on Tarjan intervals, with a complexity linear in the program size in most cases. We have implemented Give-N-Take as partof a Fortran D compiler prototype, where it solves various communication generation problems associated with compiling data-parallel languages onto distributed-memory architectures.