GIVE-N-TAKE—a balanced code placement framework

Authors:
Reinhard von Hanxleden;Ken Kennedy
Affiliations:
Rice Univ., Houston, TX;Rice Univ., Houston, TX
Venue:
PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Year:
1994

Citing 24
Cited 35

Elimination algorithms for data flow analysis

ACM Computing Surveys (CSUR)
A fast algorithm for code movement optimisation

ACM SIGPLAN Notices
Register assignment using code placement techniques

Computer Languages
Structured dataflow analysis for arrays and its use in an optimizing complier

Software—Practice & Experience
An interval-based approach to exhaustive and incremental interprocedural data-flow analysis

ACM Transactions on Programming Languages and Systems (TOPLAS)
Properties of data flow frameworks: a unified model

Acta Informatica
Detecting redundant accesses to array data

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Computer support for machine-independent parallel programming in Fortran D

Languages, compilers and run-time environments for distributed memory machines
How to analyze large programs efficiently and informatively

PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Lazy code motion

PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Evaluation of compiler optimizations for Fortran D on MIMD distributed memory machines

ICS '92 Proceedings of the 6th international conference on Supercomputing
A practical data flow framework for array reference analysis and its use in optimizations

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Communication optimization and code generation for distributed memory machines

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
The high performance Fortran handbook

The high performance Fortran handbook
An elimination algorithm for bidirectional data flow problems using edge placement

ACM Transactions on Programming Languages and Systems (TOPLAS)
A solution to a problem with Morel and Renvoise's “Global optimization by suppression of partial redundancies”

ACM Transactions on Programming Languages and Systems (TOPLAS)
Some comments on “A solution to a problem with Morel and Renvoise's 'Global optimization by suppression of partial redundancies'”

ACM Transactions on Programming Languages and Systems (TOPLAS)
Practical adaption of the global optimization algorithm of Morel and Renvoise

ACM Transactions on Programming Languages and Systems (TOPLAS)
A Fast and Usually Linear Algorithm for Global Flow Analysis

Journal of the ACM (JACM)
Global optimization by suppression of partial redundancies

Communications of the ACM
Compiler Analysis for Irregular Problems in Fortran D

Proceedings of the 5th International Workshop on Languages and Compilers for Parallel Computing
A Framework for Exploiting Data Availability to Opimize Communication

Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing
Control flow analysis

Proceedings of a symposium on Compiler optimization
Global common subexpression elimination

Proceedings of a symposium on Compiler optimization

Interprocedural partial redundancy elimination and its application to distributed memory compilation

PLDI '95 Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation
Interprocedural compilation of irregular applications for distributed memory machines

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
An HPF compiler for the IBM SP2

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Global communication analysis and optimization

PLDI '96 Proceedings of the ACM SIGPLAN 1996 conference on Programming language design and implementation
An interprocedural framework for placement of asynchronous I/O operations

ICS '96 Proceedings of the 10th international conference on Supercomputing
A Unified Framework for Optimizing Communication in Data-Parallel Programs

IEEE Transactions on Parallel and Distributed Systems
Compiler and software distributed shared memory support for irregular applications

PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
Communication optimizations for parallel C programs

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
High-level management of communication schedules in HPF-like languages

ICS '98 Proceedings of the 12th international conference on Supercomputing
Problem and machine sensitive communication optimization

ICS '98 Proceedings of the 12th international conference on Supercomputing
Interprocedural Partial Redundancy Elimination With Application to Distributed Memory Compilation

IEEE Transactions on Parallel and Distributed Systems
A General Interprocedural Framework for Placement of Split-Phase Large Latency Operations

IEEE Transactions on Parallel and Distributed Systems
A global communication optimization technique based on data-flow analysis and linear algebra

ACM Transactions on Programming Languages and Systems (TOPLAS)
Compiler and Run-Time Support for Exploiting Regularity within Irregular Applications

IEEE Transactions on Parallel and Distributed Systems
A Loop Transformation Algorithm for Communication Overlapping

International Journal of Parallel Programming - Special issue on international symposium on high performance computing 1997, part I
Data Locality Exploitation in the Decomposition of Regular Domain Problems

IEEE Transactions on Parallel and Distributed Systems
Minimizing Data and Synchronization Costs in One-Way Communication

IEEE Transactions on Parallel and Distributed Systems
Global optimization techniques for automatic parallelization of hybrid applications

ICS '01 Proceedings of the 15th international conference on Supercomputing
Static Single Assignment Form for Message-Passing Programs

International Journal of Parallel Programming
Contention elimination by replication of sequential sections in distributed shared memory programs

PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
Compiler optimization of dynamic data distributions for distributed-memory multicomputers

Compiler optimizations for scalable parallel systems
A framework for global communication analysis of optimizations

Compiler optimizations for scalable parallel systems
Bidirectional data flow analysis: myths and reality

ACM SIGPLAN Notices
Compiling Several Classes of Communication Patterns on a Multithreaded Architecture

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
A Comparison of Parallelization Techniques for Irregular Reductions

IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Improving Compiler and Run-Time Support for Irregular Reductions Using Local Writes

LCPC '98 Proceedings of the 11th International Workshop on Languages and Compilers for Parallel Computing
Improving Locality for Adaptive Irregular Scientific Codes

LCPC '00 Proceedings of the 13th International Workshop on Languages and Compilers for Parallel Computing-Revised Papers
A Comparison of Locality Transformations for Irregular Codes

LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Accurate Data and Context Management in Message-Passing Programs

LCPC '99 Proceedings of the 12th International Workshop on Languages and Compilers for Parallel Computing
Aggressive communication optimizations for clusters of workstations

Cluster computing
Instruction scheduling for a tiled dataflow architecture

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Interprocedural definition-use chains of dynamic pointer-linked data structures

Scientific Programming
Runtime characterisation of irregular accesses applied to parallelisation of irregular reductions

International Journal of Computational Science and Engineering
Optimizing irregular shared-memory applications for clusters

Proceedings of the 22nd annual international conference on Supercomputing
Bidirectional data flow analysis for type inferencing

Computer Languages, Systems and Structures

Quantified Score

Hi-index	0.00

Visualization

Abstract

GIVE-N-TAKE is a code placement framework which uses a general producer-consumer concept. An advantage of GIVE-N-TAKE over existing partial redundancy elimination techniques is its concept of production regions, instead of single locations, which can be beneficial for general latency hiding. GIVE-N-TAKE guaranteed balanced production, that is, each production will be started and stopped once. The framework can also take advantage of production coming “for free,” as induced by side effects, without disturbing balance. GIVE-N-TAKE can place production either before or after consumption, and it also provides the option to hoist code out of potentially zero-trip loop (nest) constructs. GIVE-N-TAKE uses a fast elimination method based on Tarjan intervals, with a complexity linear in the program size in most cases.We have implemented GIVE-N-TAKE as part of a Fortran D compiler prototype, where it solves various communication generation problems associated with compiling data-parallel languages onto distributed-memory architectures.