Compiling C for vectorization, parallelization, and inline expansion

Authors:
R. Allen;S. Johnson
Affiliations:
Ardent Computer Corporation, Sunnyvale, CA;Ardent Computer Corporation, Sunnyvale, CA
Venue:
PLDI '88 Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation
Year:
1988

Citing 11
Cited 29

Efficient instruction scheduling for a pipelined architecture

SIGPLAN '86 Proceedings of the 1986 SIGPLAN symposium on Compiler construction
A vectorizing Fortran compiler

IBM Journal of Research and Development
Programming environments for supercomputers

Supercomputers: algorithms, architectures, and scientific computation
Constant propagation with conditional branches

POPL '85 Proceedings of the 12th ACM SIGACT-SIGPLAN symposium on Principles of programming languages
Global optimization by suppression of partial redundancies

Communications of the ACM
The parallel execution of DO loops

Communications of the ACM
A portable compiler: theory and practice

POPL '78 Proceedings of the 5th ACM SIGACT-SIGPLAN symposium on Principles of programming languages
Conversion of control dependence to data dependence

POPL '83 Proceedings of the 10th ACM SIGACT-SIGPLAN symposium on Principles of programming languages
Dependence graphs and compiler optimizations

POPL '81 Proceedings of the 8th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Dependence analysis for subscripted variables and its application to program transformations

Dependence analysis for subscripted variables and its application to program transformations
Optimizing supercompilers for supercomputers

Optimizing supercompilers for supercomputers

Inline function expansion for compiling C programs

PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
An efficient method of computing static single assignment form

POPL '89 Proceedings of the 16th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Efficiently computing static single assignment form and the control dependence graph

ACM Transactions on Programming Languages and Systems (TOPLAS)
Pointer target tracking—an empirical study

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Interprocedural transformations for parallel code generation

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Vectorizing C compilers: how good are they?

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Subprogram Inlining: A Study of its Effects on Program Execution Time

IEEE Transactions on Software Engineering
Inlining semantics for subroutines which are recursive

ACM SIGPLAN Notices
Towards better inlining decisions using inlining trials

LFP '94 Proceedings of the 1994 ACM conference on LISP and functional programming
Obtaining sequential efficiency for concurrent object-oriented languages

POPL '95 Proceedings of the 22nd ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Region-based compilation: an introduction and motivation

Proceedings of the 28th annual international symposium on Microarchitecture
High-level microprogramming: an optimizing C compiler for a processing element of a CAD accelerator

MICRO 23 Proceedings of the 23rd annual workshop and symposium on Microprogramming and microarchitecture
Aggressive inlining

Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
Control flow optimization for supercomputer scalar processing

ICS '89 Proceedings of the 3rd international conference on Supercomputing
Modular interprocedural pointer analysis using access paths: design, implementation, and evaluation

PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
An empirical evaluation of high level transformations for embedded processors

CASES '01 Proceedings of the 2001 international conference on Compilers, architecture, and synthesis for embedded systems
The Effect of Code Expanding Optimizations on Instruction Cache Design

IEEE Transactions on Computers
Array recovery and high-level transformations for DSP applications

ACM Transactions on Embedded Computing Systems (TECS)
Parameter-Induced Aliasing in Ada

Ada Europe '01 Proceedings of the 6th Ade-Europe International Conference Leuven on Reliable Software Technologies
High Level Compiling for Low Level Machines

PACT '93 Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism
Compiler Transformation of Pointers to Explicit Array Accesses in DSP Applications

CC '01 Proceedings of the 10th International Conference on Compiler Construction
A unified framework for nonlinear dependence testing and symbolic analysis

Proceedings of the 18th annual international conference on Supercomputing
EDO: Exception-directed optimization in java

ACM Transactions on Programming Languages and Systems (TOPLAS)
Reaching fast code faster: using modeling for efficient software thread integration on a VLIW DSP

CASES '06 Proceedings of the 2006 international conference on Compilers, architecture and synthesis for embedded systems
Enlarging Instruction Streams

IEEE Transactions on Computers
Selective search of inlining vectors for program optimization

Proceedings of the 9th conference on Computing Frontiers
On the determination of inlining vectors for program optimization

CC'13 Proceedings of the 22nd international conference on Compiler Construction
Fast condensation of the program dependence graph

Proceedings of the 34th ACM SIGPLAN conference on Programming language design and implementation
Software thread integration for instruction-level parallelism

ACM Transactions on Embedded Computing Systems (TECS)

Quantified Score

Hi-index	0.01

Visualization

Abstract

Practical implementations of real languages are often an excellent way of testing the applicability of theoretical principles. Many stresses and strains arise from fitting practicalities, such as performance and standard compatibility, to theoretical models and methods. These stresses and strains are valuable sources of new research and insight, as well as an oft-needed check on the egos of theoreticians.Two fertile areas that are often explored by implementations arePlaces where tractable models fail to match practice. This can lead to new models, and may also affect practice (e.g., the average programming language has become more context free over the last several decades).Places where existing algorithms fail to deal with practical problems effectively, frequently because the problems are large in some dimension that has not been much explored.The present paper discusses the application of a much studied body of algorithms and techniques [Alle 83, KKLW 80, Bane 76, Wolf 78, Wolf 82, Kenn 80, Lamp 74, Huso 82] for vectorizing and optimizing Fortran to the problem of vectorizing and optimizing C. In the course of this work some algorithms were discarded, others invented, and many were tuned and modified. The experience gave us insight into the strengths and weaknesses of the current theory, as well as into the strong and weak points of C on vector/parallel machines. This paper attempts to communicate some of those insights.