Eliminating redundancies in sum-of-product array computations

Authors:
Steven J. Deitz;Bradford L. Chamberlain;Lawrence Snyder
Affiliations:
University of Washington, Dept of CSE/ Box 352350, Seattle, WA;University of Washington, Dept of CSE/ Box 352350, Seattle, WA;University of Washington, Dept of CSE/ Box 352350, Seattle, WA
Venue:
ICS '01 Proceedings of the 15th international conference on Supercomputing
Year:
2001

Citing 12
Cited 4

Improving register allocation for subscripted variables

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Fortran at ten gigaflops: the connection machine convolution compiler

PLDI '91 Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation
Scalar replacement in the presence of conditional control flow

Software—Practice & Experience
The implementation and evaluation of fusion and contraction in array languages

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Code Generation for Expressions with Common Subexpressions

Journal of the ACM (JACM)
A comparative study of the NAS MG benchmark across parallel languages and architectures

Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Compiling stencils in high performance Fortran

SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
FORTRAN 95 Handbook

FORTRAN 95 Handbook
Computer and Robot Vision

Computer and Robot Vision
Loop optimization for aggregate array computations

ICCL '98 Proceedings of the 1998 International Conference on Computer Languages
Serializing Parallel Programs by Removing Redundant Computation

Serializing Parallel Programs by Removing Redundant Computation
Handbook of Mathematical Functions, With Formulas, Graphs, and Mathematical Tables,

Handbook of Mathematical Functions, With Formulas, Graphs, and Mathematical Tables,

The design and development of ZPL

Proceedings of the third ACM SIGPLAN conference on History of programming languages
Redundancy elimination revisited

Proceedings of the 17th international conference on Parallel architectures and compilation techniques
Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs

Proceedings of the 23rd international conference on Supercomputing
Using machine learning to improve automatic vectorization

ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers

Quantified Score

Hi-index	0.00

Visualization

Abstract

Array programming languages such as Fortran 90, High Performance Fortran and ZPL are well-suited to scientific computing because they free the scientist from the responsibility of managing burdensome low-level details that complicate programming in languages like C and Fortran 77. However, these burdensome details are critical to performance, thus necessitating aggressive compilation techniques for their optimization. In this paper, we present a new compiler optimization called Array Subexpression Elimination (ASE) that lets a programmer take advantage of the expressibility afforded by array languages and achieve enviable portability and performance. We design a set of micro-benchmarks that model an important class of computations known as stencils and we report on our implementation of this optimization in the context of this micro-benchmark suite. Our results include a 125% improvement on one of these benchmarks and a 50% average speedup across the suite. Also we show a speedup of 32% improvement on the ZPL port of the NAS MG Parallel Benchmark and a 29% speedup over the hand-optimized Fortran version. Further, the compilation time is only negligibly affected.