Code scheduling and register allocation in large basic blocks

Authors:
J. R. Goodman;W.-C. Hsu
Affiliations:
Univ. of Wisconsin, Madison, WI;Cray Research Inc., Chippewa Falls, WI
Venue:
ICS '88 Proceedings of the 2nd international conference on Supercomputing
Year:
1988

Citing 12
Cited 76

Compilers: principles, techniques, and tools

Compilers: principles, techniques, and tools
Advanced compiler optimizations for supercomputers

Communications of the ACM - Special issue on parallelism
Guest Editor's Introduction GaAs Microprocessor Technology

Computer - Special issue: GaAs: a technology for environmental extremes
Efficient instruction scheduling for a pipelined architecture

SIGPLAN '86 Proceedings of the 1986 SIGPLAN symposium on Compiler construction
A retargetable instruction reorganizer

SIGPLAN '86 Proceedings of the 1986 SIGPLAN symposium on Compiler construction
A study of scalar compilation techniques for pipelined supercomputers

ASPLOS II Proceedings of the second international conference on Architectual support for programming languages and operating systems
The Generation of Optimal Code for Arithmetic Expressions

Journal of the ACM (JACM)
Code Generation for Expressions with Common Subexpressions

Journal of the ACM (JACM)
Postpass Code Optimization of Pipeline Constraints

ACM Transactions on Programming Languages and Systems (TOPLAS)
An overview of the PL.8 compiler

SIGPLAN '82 Proceedings of the 1982 SIGPLAN symposium on Compiler construction
Bulldog: a compiler for vliw architectures (parallel computing, reduced-instruction-set, trace scheduling, scientific)

Bulldog: a compiler for vliw architectures (parallel computing, reduced-instruction-set, trace scheduling, scientific)
Design of a Computer—The Control Data 6600

Design of a Computer—The Control Data 6600

Achieving high instruction cache performance with an optimizing compiler

ISCA '89 Proceedings of the 16th annual international symposium on Computer architecture
Instruction scheduling beyond basic blocks

IBM Journal of Research and Development
Integrating register allocation and instruction scheduling for RISCs

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
The Marion system for retargetable instruction scheduling

PLDI '91 Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation
IMPACT: an architectural framework for multiple-instruction-issue processors

ISCA '91 Proceedings of the 18th annual international symposium on Computer architecture
The effect on RISC performance of register set size and structure versus code generation strategy

ISCA '91 Proceedings of the 18th annual international symposium on Computer architecture
Efficient DAG construction and heuristic calculation for instruction scheduling

MICRO 24 Proceedings of the 24th annual international symposium on Microarchitecture
Efficient superscalar performance through boosting

ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
Code scheduling for VLIW/superscalar processors with limited register files

MICRO 25 Proceedings of the 25th annual international symposium on Microarchitecture
Techniques for integrating parallelizing transformations and compiler-based scheduling methods

Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Register allocation with instruction scheduling

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Lifetime-sensitive modulo scheduling

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Scheduling time-critical instructions on RISC machines

ACM Transactions on Programming Languages and Systems (TOPLAS)
Sentinel scheduling: a model for compiler-controlled speculative execution

ACM Transactions on Computer Systems (TOCS)
A schedular-sensitive global register allocator

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Zero-cost range splitting

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
GURRR: a global unified resource requirements representation

IR '95 Papers from the 1995 ACM SIGPLAN workshop on Intermediate representations
Compiler-Based Multiple Instruction Retry

IEEE Transactions on Computers
Efficient instruction scheduling for delayed-load architectures

ACM Transactions on Programming Languages and Systems (TOPLAS)
Register allocation sensitive region scheduling

PACT '95 Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques
Allocating registers in multiple instruction-issuing processors

PACT '95 Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques
Spill-free parallel scheduling of basic blocks

Proceedings of the 28th annual international symposium on Microarchitecture
An experimental study of several cooperative register allocation and instruction scheduling strategies

Proceedings of the 28th annual international symposium on Microarchitecture
Register allocation for predicated code

Proceedings of the 28th annual international symposium on Microarchitecture
A reduced multipipeline machine description that preserves scheduling constraints

PLDI '96 Proceedings of the ACM SIGPLAN 1996 conference on Programming language design and implementation
A register allocation technique using guarded PDG

ICS '96 Proceedings of the 10th international conference on Supercomputing
Experiences with Cooperating Register Allocation and Instruction Scheduling

International Journal of Parallel Programming
IMPACT: an architectural framework for multiple-instruction-issue processors

25 years of the international symposia on Computer architecture (selected papers)
Reorganizing global schedules for register allocation

ICS '99 Proceedings of the 13th international conference on Supercomputing
Code generation of nested loops for DSP processors with heterogeneous registers and structural pipelining

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Register-sensitive selection, duplication, and sequencing of instructions

ICS '01 Proceedings of the 15th international conference on Supercomputing
A brief survey of papers on scheduling for pipelined processors

ACM SIGPLAN Notices
A Dynamic Programming Approach to Optimal Integrated Code Generation

OM '01 Proceedings of the 2001 ACM SIGPLAN workshop on Optimization of middleware and distributed systems
Global array reference allocation

ACM Transactions on Design Automation of Electronic Systems (TODAES)
An efficient technique for exploring register file size in ASIP synthesis

CASES '02 Proceedings of the 2002 international conference on Compilers, architecture, and synthesis for embedded systems
Handling Global Constraints in Compiler Strategy

International Journal of Parallel Programming
Computation in the Context of Transport Triggered Architectures

International Journal of Parallel Programming
The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors

IEEE Transactions on Computers
Three Architectural Models for Compiler-Controlled Speculative Execution

IEEE Transactions on Computers
Efficient Exploitation of Instruction-Level Parallelism for Superscalar Processors by the Conjugate Register File Scheme

IEEE Transactions on Computers
Minimum Register Instruction Sequencing to Reduce Register Spills in Out-of-Order Issue Superscalar Architectures

IEEE Transactions on Computers
A Register Allocation Technique Using Register Existence Graph

ICPP '97 Proceedings of the international Conference on Parallel Processing
Minimum Register Instruction Sequence Problem: Revisiting Optimal Code Generation for DAGs

IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Copy Elimination for Parallelizing Compilers

LCPC '98 Proceedings of the 11th International Workshop on Languages and Compilers for Parallel Computing
Integrated Instruction Scheduling and Register Allocation Techniques

LCPC '98 Proceedings of the 11th International Workshop on Languages and Compilers for Parallel Computing
Minimum Register Instruction Scheduling: A New Approach for Dynamic Instruction Issue Processors

LCPC '99 Proceedings of the 12th International Workshop on Languages and Compilers for Parallel Computing
Understanding and Improving Register Assignment

Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Array Reference Allocation Using SSA-Form and Live Range Growth

LCTES '00 Proceedings of the ACM SIGPLAN Workshop on Languages, Compilers, and Tools for Embedded Systems
Resource Spackling: A Framework for Integrating Register Allocation in Local and Global Schedulers

PACT '94 Proceedings of the IFIP WG10.3 Working Conference on Parallel Architectures and Compilation Techniques
URSA: A Unified ReSource Allocator for Registers and Functional Units in VLIW Architectures

PACT '93 Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism
Register Saturation in Superscalar and VLIW Codes

CC '01 Proceedings of the 10th International Conference on Compiler Construction
An Efficient Technique of Instruction Scheduling on a Superscalar-Based Mulprocessor

IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Integrated prepass scheduling for a Java Just-In-Time compiler on the IA-64 architecture

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Compiler optimization-space exploration

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Evolution of a java just-in-time compiler for IA-32 platforms

IBM Journal of Research and Development
Register saturation in instruction level parallelism

International Journal of Parallel Programming
A framework for reducing instruction scheduling overhead in dynamic compilers

CASCON '06 Proceedings of the 2006 conference of the Center for Advanced Studies on Collaborative research
An experimental comparison of cache-oblivious and cache-conscious programs

Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
Tetris: a new register pressure control technique for VLIW processors

Proceedings of the 2007 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems
Fast, frequency-based, integrated register allocation and instruction scheduling

Software—Practice & Experience
Quasistatic shared libraries and XIP for memory footprint reduction in MMU-less embedded systems

ACM Transactions on Embedded Computing Systems (TECS)
Tetris-XL: A performance-driven spill reduction technique for embedded VLIW processors

ACM Transactions on Architecture and Code Optimization (TACO)
Improving both the performance benefits and speed of optimization phase sequence searches

Proceedings of the ACM SIGPLAN/SIGBED 2010 conference on Languages, compilers, and tools for embedded systems
Eliminating false phase interactions to reduce optimization phase order search space

CASES '10 Proceedings of the 2010 international conference on Compilers, architectures and synthesis for embedded systems
Register pressure aware scheduling for high level synthesis

Proceedings of the 16th Asia and South Pacific Design Automation Conference
Max-coloring and online coloring with bandwidths on interval graphs

ACM Transactions on Algorithms (TALG)
Cooperative instruction scheduling with linear scan register allocation

HiPC'05 Proceedings of the 12th international conference on High Performance Computing
Integrated instruction scheduling and fine-grain register allocation for embedded processors

SAMOS'06 Proceedings of the 6th international conference on Embedded Computer Systems: architectures, Modeling, and Simulation
Generating optimal contiguous evaluations for expression DAGs

Computer Languages
Scheduling expression DAGs for minimal register need

Computer Languages
A modular memory optimization for synchronous data-flow languages: application to arrays in a lustre compiler

Proceedings of the 13th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, Tools and Theory for Embedded Systems
Energy efficient special instruction support in an embedded processor with compact isa

Proceedings of the 2012 international conference on Compilers, architectures and synthesis for embedded systems
Optimal and heuristic global code motion for minimal spilling

CC'13 Proceedings of the 22nd international conference on Compiler Construction
An energy-efficient method of supporting flexible special instructions in an embedded processor with compact ISA

ACM Transactions on Architecture and Code Optimization (TACO)
Preallocation instruction scheduling with register pressure minimization using a combinatorial optimization approach

ACM Transactions on Architecture and Code Optimization (TACO)
Exploiting phase inter-dependencies for faster iterative compiler optimization phase order searches

Proceedings of the 2013 International Conference on Compilers, Architectures and Synthesis for Embedded Systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

We discuss the issues about the interdependency between code scheduling and register allocation. We present two methods as solutions: (1) an integrated code scheduling technique; and (2) a DAG-driven register allocator. The integrated code scheduling method combines two scheduling techniques—one to reduce pipeline delays and the other to minimize register usage—into a single phase. By keeping track of the number of available registers, the scheduler can choose the appropriate scheduling technique to schedule a better code sequence. The DAG-driven register allocator uses a dependency graph to assist in assigning registers; it introduces much less extra dependency than does an ordinary register allocator. For large basic blocks, both approaches were shown to generate more efficient code sequences than conventional techniques in the simulations.