Aggressive inlining

Authors:
Andrew Ayers;Richard Schooler;Robert Gottlieb
Affiliations:
Hewlett-Packard Massachusetts Language Laboratory, 300 Apollo Drive, Chelmsford, MA;Hewlett-Packard Massachusetts Language Laboratory, 300 Apollo Drive, Chelmsford, MA;Hewlett-Packard Massachusetts Language Laboratory, 300 Apollo Drive, Chelmsford, MA
Venue:
Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
Year:
1997

Citing 11
Cited 34

A study of a C function inliner

Software—Practice & Experience
Compiling C for vectorization, parallelization, and inline expansion

PLDI '88 Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation
Inline function expansion for compiling C programs

PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
Interprocedural analysis vs. procedure integration

Information Processing Letters
Profile guided code positioning

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Profile-guided automatic inline expansion for C programs

Software—Practice & Experience
An analysis of inline substitution for a structured programming language

Communications of the ACM
Advanced performance features of the 64-bit PA-8000

COMPCON '95 Proceedings of the 40th IEEE Computer Society International Conference
Compiler Optimizations for the PA-8000

COMPCON '97 Proceedings of the 42nd IEEE International Computer Conference
Predicting the effects of optimization on a procedure body

SIGPLAN '79 Proceedings of the 1979 SIGPLAN symposium on Compiler construction
Improving UNIX kernel performance using profile based optimization

WTEC'94 Proceedings of the USENIX Winter 1994 Technical Conference on USENIX Winter 1994 Technical Conference

Interprocedural conditional branch elimination

Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation
Complete removal of redundant expressions

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Scalable cross-module optimization

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Architectural and compiler support for energy reduction in the memory hierarchy of high performance microprocessors

ISLPED '98 Proceedings of the 1998 international symposium on Low power electronics and design
An automatic object inlining optimization and its evaluation

PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
A comparative study of static and profile-based heuristics for inlining

DYNAMO '00 Proceedings of the ACM SIGPLAN workshop on Dynamic and adaptive compilation and optimization
A study of exception handling and its dynamic optimization in Java

OOPSLA '01 Proceedings of the 16th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
In or out?: putting write barriers in their place

Proceedings of the 3rd international symposium on Memory management
Online feedback-directed optimization of Java

OOPSLA '02 Proceedings of the 17th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
An Empirical Study of Method In-lining for a Java Just-in-Time Compiler

Proceedings of the 2nd Java Virtual Machine Research and Technology Symposium
Adaptive online context-sensitive inlining

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
On the side-effects of code abstraction

Proceedings of the 2003 ACM SIGPLAN conference on Language, compiler, and tool for embedded systems
Function inlining versus function cloning

ACM SIGPLAN Notices
Function inlining versus function cloning

ACM SIGPLAN Notices
Should potential loop optimizations influence inlining decisions?

CASCON '03 Proceedings of the 2003 conference of the Centre for Advanced Studies on Collaborative research
SYZYGY - A Framework for Scalable Cross-Module IPO

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Complete removal of redundant expressions

ACM SIGPLAN Notices - Best of PLDI 1979-1999
Scalable High Performance Cross-Module Inlining

Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques
Design and evaluation of dynamic optimizations for a Java just-in-time compiler

ACM Transactions on Programming Languages and Systems (TOPLAS)
EDO: Exception-directed optimization in java

ACM Transactions on Programming Languages and Systems (TOPLAS)
Eliminating stack overflow by abstract interpretation

ACM Transactions on Embedded Computing Systems (TECS)
Inline Analysis: Beyond Selection Heuristics

Proceedings of the International Symposium on Code Generation and Optimization
A framework for unrestricted whole-program optimization

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Optimizing subroutines with optional parameters in F90 via function cloning

ACM SIGPLAN Notices
Complete inlining of recursive calls: beyond tail-recursion elimination

Proceedings of the 44th annual Southeast regional conference
Improving Compilation of Java Scientific Applications

International Journal of High Performance Computing Applications
Adaptive inlining and on-stack replacement in the CACAO virtual machine

Proceedings of the 5th international symposium on Principles and practice of programming in Java
Enlarging Instruction Streams

IEEE Transactions on Computers
Function Inlining in Embedded Systems with Code Size Limitation

ICESS '07 Proceedings of the 3rd international conference on Embedded Software and Systems
Jolt: lightweight dynamic analysis and removal of object churn

Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
Optimal interprocedural program optimization: a new framework and its application

Optimal interprocedural program optimization: a new framework and its application
Lightweight feedback-directed cross-module optimization

Proceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization
Aggressive function inlining: preventing loop blockings in the instruction cache

HiPEAC'08 Proceedings of the 3rd international conference on High performance embedded architectures and compilers
On the determination of inlining vectors for program optimization

CC'13 Proceedings of the 22nd international conference on Compiler Construction

Quantified Score

Hi-index	0.00

Visualization

Abstract

Existing research understates the benefits that can be obtained from inlining and cloning, especially when guided by profile information. Our implementation of inlining and cloning yields excellent results on average and very rarely lowers performance. We believe our good results can be explained by a number of factors: inlining at the intermediate-code level removes most technical restrictions on what can be inlined; the ability to inline across files and incorporate profile information enables us to choose better inline candidates; a high-quality back end can exploit the scheduling and register allocation opportunities presented by larger subroutines; an aggressive processor architecture benefits from more predictable branch behavior; and a large instruction cache mitigates the impact of code expansion. We describe the often dramatic impact of our inlining and cloning on performance: for example, the implementations of our inlining and cloning algorithms in the HP-UX 10.20 compilers boost SPECint95 performance on a PA8000-based workstation by a factor of 1.32.