Static analysis of upper and lower bounds on dependences and parallelism

Authors:
William Pugh;David Wonnacott
Affiliations:
Univ. of Maryland, College Park;Univ. of Maryland, College Park
Venue:
ACM Transactions on Programming Languages and Systems (TOPLAS)
Year:
1994

Citing 34
Cited 6

Automatic translation of FORTRAN programs to vector form

ACM Transactions on Programming Languages and Systems (TOPLAS)
The importance of direct dependences for automatic parallelization

ICS '88 Proceedings of the 2nd international conference on Supercomputing
Array expansion

ICS '88 Proceedings of the 2nd international conference on Supercomputing
Structured dataflow analysis for arrays and its use in an optimizing complier

Software—Practice & Experience
Supercompilers for parallel and vector computers

Supercompilers for parallel and vector computers
Semantical interprocedural parallelization: an overview of the PIPS project

ICS '91 Proceedings of the 5th international conference on Supercomputing
Uniform techniques for loop optimization

ICS '91 Proceedings of the 5th international conference on Supercomputing
Scanning polyhedra with DO loops

PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
Subdomain dependence test for massive parallelism

Proceedings of the 1990 ACM/IEEE conference on Supercomputing
Efficient and exact data dependence analysis

PLDI '91 Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation
Practical dependence testing

PLDI '91 Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation
A practical algorithm for exact array dependence analysis

Communications of the ACM
Eliminating false data dependences using the Omega test

PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Delinearization: an efficient way to break multiloop dependence equations

PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Deriving good transformations for mapping nested loops on hierarchical parallel machines in polynomial time

ICS '92 Proceedings of the 6th international conference on Supercomputing
Array privatization for parallel execution of loops

ICS '92 Proceedings of the 6th international conference on Supercomputing
Definitions of dependence distance

ACM Letters on Programming Languages and Systems (LOPLAS)
A framework for unifying reordering transformations

A framework for unifying reordering transformations
A practical data flow framework for array reference analysis and its use in optimizations

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Communication optimization and code generation for distributed memory machines

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Array-data flow analysis and its use in array privatization

POPL '93 Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Accurate analysis of array references

Accurate analysis of array references
Toward a methodology of optimizing programs for high-performance computers

ICS '93 Proceedings of the 7th international conference on Supercomputing
Static and dynamic evaluation of data dependence analysis

ICS '93 Proceedings of the 7th international conference on Supercomputing
Lazy array data-flow dependence analysis

POPL '94 Proceedings of the 21st ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Going Beyond Integer Programming with the Omega Test to Eliminate False Data Dependences

IEEE Transactions on Parallel and Distributed Systems
Mathematical Foundation of Parallel Computing

Mathematical Foundation of Parallel Computing
Optimizing Supercompilers for Supercomputers

Optimizing Supercompilers for Supercomputers
The Power Test for Data Dependence

IEEE Transactions on Parallel and Distributed Systems
Loop-Level Parallelism in Numeric and Symbolic Programs

IEEE Transactions on Parallel and Distributed Systems
Experience in the Automatic Parallelization of Four Perfect-Benchmark Programs

Proceedings of the Fourth International Workshop on Languages and Compilers for Parallel Computing
Data Dependence and Data-Flow Analysis of Arrays

Proceedings of the 5th International Workshop on Languages and Compilers for Parallel Computing
An Exact Method for Analysis of Value-based Array Data Dependences

Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing
Incremental dependence analysis

Incremental dependence analysis

Counting solutions to Presburger formulas: how and why

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Vectorization beyond data dependences

ICS '95 Proceedings of the 9th international conference on Supercomputing
An exact array reference analysis for data flow testing

Proceedings of the 18th international conference on Software engineering
Index Set Splitting

International Journal of Parallel Programming
The formalism underlying EASYMAP: A precompiler for refinement-based exploration of hierarchical data organizations

Science of Computer Programming
Programmable data dependencies and placements

DAMP '12 Proceedings of the 7th workshop on Declarative aspects and applications of multicore programming

Quantified Score

Hi-index	0.00

Visualization

Abstract

Existing compilers often fail to parallelize sequential code, even when a program can be manually transformed into parallel form by a sequence of well-understood transformations (as in the case for many of the Perfect Club Benchmark programs). These failures can occur for several reasons: the code transformations implemented in the compiler may not be sufficient to produce parallel code, the compiler may not find the proper sequence of transformations, or the compiler may not be able to prove that one of the necessary transformations is legal.When a compiler fails to extract sufficient parallelism from a program, the programmer may try to extract additional parallelism. Unfortunately, the programmer is typically left to search for parallelism without significant assistance. The compiler generally does not give feedback about which parts of the program might contain additional parallelism, or about the types of transformations that might be needed to realize this parallelism. Standard program transformations and dependence abstractions cannot be used to provide this feedback.In this paper, we propose a two-step approach to the search for parallelism in sequential programs. In the first step, we construct several sets of constraints that describe, for each statement, which iterations of that statement can be executed concurrently. By constructing constraints that correspond to different assumptions about which dependences might be eliminated through additional analysis, transformations, and user assertions, we can determine whether we can expose parallelism by eliminating dependences. In the second step of our search for parallelism, we examine these constraint sets to identify the kinds of transformations needed to exploit scalable parallelism. Our tests will identify conditional parallelism and parallelism that can be exposed by combinations of transformations that reorder the iteration space (such as loop interchange and loop peeling).This approach lets us distinguish inherently sequential code from code that contains unexploited parallelism. It also produces information about the kinds of transformations needed to parallelize the code, without worrying about the order of application of the transformations. Furthermore, when our dependence test is inexact we can identify which unresolved dependences inhibit parallelism by comparing the effects of assuming dependence or independence. We are currently exploring the use of this information in programmer-assisted parallelization.