Register-sensitive selection, duplication, and sequencing of instructions

Authors:
Vivek Sarkar;Mauricio J. Serrano;Barbara B. Simons
Affiliations:
IBM Research, T. J. Watson Research Center;Intel Microprocessor, Research Labs;Stanford University
Venue:
ICS '01 Proceedings of the 15th international conference on Supercomputing
Year:
2001

Citing 25
Cited 3

Compilers: principles, techniques, and tools

Compilers: principles, techniques, and tools
The program dependence graph and its use in optimization

ACM Transactions on Programming Languages and Systems (TOPLAS)
Automatic translation of FORTRAN programs to vector form

ACM Transactions on Programming Languages and Systems (TOPLAS)
Code scheduling and register allocation in large basic blocks

ICS '88 Proceedings of the 2nd international conference on Supercomputing
Optimal code generation for expression trees: an application BURS theory

POPL '88 Proceedings of the 15th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Experiences using control dependence in PTRAN

Selected papers of the second workshop on Languages and compilers for parallel computing
Integrating register allocation and instruction scheduling for RISCs

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
A compiler-assisted approach to SPMD execution

Proceedings of the 1990 ACM/IEEE conference on Supercomputing
Register allocation with instruction scheduling

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Scalar replacement in the presence of conditional control flow

Software—Practice & Experience
Instruction scheduling for the HP PA-8000

Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture
Optimal code selection in DAGs

Proceedings of the 26th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
The Jalapeño dynamic optimizing compiler for Java

JAVA '99 Proceedings of the ACM 1999 conference on Java Grande
Efficient and precise modeling of exceptions for the analysis of Java programs

Proceedings of the 1999 ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering
Adaptive optimization in the Jalapeño JVM

OOPSLA '00 Proceedings of the 15th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Optimizing Supercompilers for Supercomputers

Optimizing Supercompilers for Supercomputers
Java Virtual Machine Specification

Java Virtual Machine Specification
A Retargetable C Compiler: Design and Implementation

A Retargetable C Compiler: Design and Implementation
Baring It All to Software: Raw Machines

Computer
URSA: A Unified ReSource Allocator for Registers and Functional Units in VLIW Architectures

PACT '93 Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism
Unified Analysis of Array and Object References in Strongly Typed Languages

SAS '00 Proceedings of the 7th International Symposium on Static Analysis
An overview of the PL.8 compiler

SIGPLAN '82 Proceedings of the 1982 SIGPLAN symposium on Compiler construction
A Register Pressure Sensitive Instruction Scheduler for Dynamic Issue Processors

PACT '97 Proceedings of the 1997 International Conference on Parallel Architectures and Compilation Techniques
Evaluating Register Allocation and Instruction Scheduling Techniques in Out-Of-Order Issue Processors

PACT '99 Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques
The Jalapeño virtual machine

IBM Systems Journal

Experiences Porting the Jikes RVM to Linux/IA32

Proceedings of the 2nd Java Virtual Machine Research and Technology Symposium
Factor: a dynamic stack-based programming language

Proceedings of the 6th symposium on Dynamic languages
Efficient Selection of Vector Instructions Using Dynamic Programming

MICRO '43 Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a new framework for selecting, duplicating and sequencing instructions so as to decrease register pressure. The motivation for this work is to target current and future high-performance processors where reductions in register pressure in the compiled programs can lead to improved performance.For instruction selection and duplication, a unique feature of our approach is the ability to perform these transformations on intermediate-language instructions in a general dependence graph that contains both true and non-true dependences, unlike past work that restricted their attention to a single expression tree or a single expression dag. For instruction sequencing, we present a new algorithm for reducing register pressure that is based on backwards schedulingWe present preliminary performance results to validate our approach. Our results show that register-sensitive instruction duplication can deliver significant speedups (up to 1.22x) for the SPECint95 benchmarks on an IA-32 processor. We also show that register-sensitive sequencing delivers smaller speedups (up to 1.12x) for the SPECjvm and Java Grande benchmarks on a PowerPC processor (when utilizing two-thirds of its registers). We expect to see more significant speedups due to register-sensitive sequencing on processors with fewer register than the PowerPC (such as the IA-32).