A compilation technique for software pipelining of loops with conditional jumps

Authors:
Kemal Ebcioğlu
Affiliations:
IBM, Thomas J. Watson Research Center, P.O. Box 704, Yorktown Heights, NY
Venue:
MICRO 20 Proceedings of the 20th annual workshop on Microprogramming
Year:
1987

Citing 15
Cited 54

Bulldog: a compiler for VLSI architectures

Bulldog: a compiler for VLSI architectures
A computer with low-level parallelism QA-2: its applications to 3-D graphics and Prolog/Lisp machines

ISCA '86 Proceedings of the 13th annual international symposium on Computer architecture
URPR—An extension of URCR for software pipelining

MICRO 19 Proceedings of the 19th annual workshop on Microprogramming
The GF11 supercomputer

ISCA '85 Proceedings of the 12th annual international symposium on Computer architecture
A technique for reducing synchronization overhead in large scale multiprocessors

ISCA '85 Proceedings of the 12th annual international symposium on Computer architecture
Pipeline Architecture

ACM Computing Surveys (CSUR)
A Fortran compiler for the FPS-164 scientific computer

SIGPLAN '84 Proceedings of the 1984 SIGPLAN symposium on Compiler construction
Mathematical Theory of Program Correctness

Mathematical Theory of Program Correctness
A critique of multiprocessing von Neumann style

ISCA '83 Proceedings of the 10th annual international symposium on Computer architecture
Efficient code generation for horizontal architectures: Compiler techniques and architectural support

ISCA '82 Proceedings of the 9th annual symposium on Computer Architecture
A critique of multiprocessing von Neumann style

ISCA '83 Proceedings of the 10th annual international symposium on Computer architecture
Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing

MICRO 14 Proceedings of the 14th annual workshop on Microprogramming
The microprogramming of pipelined processors

ISCA '77 Proceedings of the 4th annual symposium on Computer architecture
Percolation Scheduling: A Parallel Compilation Technique

Percolation Scheduling: A Parallel Compilation Technique
The optimization of horizontal microcode within and beyond basic blocks: an application of processor scheduling with resources

The optimization of horizontal microcode within and beyond basic blocks: an application of processor scheduling with resources

The white dwarf: a high-performance application-specific processor

ISCA '88 Proceedings of the 15th Annual International Symposium on Computer architecture
Optimal loop parallelization

PLDI '88 Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation
Software pipelining: an effective scheduling technique for VLIW machines

PLDI '88 Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation
Requirements for optimal execution of oops with tests

ICS '88 Proceedings of the 2nd international conference on Supercomputing
“Combining” as a compilation technique for VLIW architectures

MICRO 22 Proceedings of the 22nd annual workshop on Microprogramming and microarchitecture
Instruction scheduling for the IBM RISC System/6000 processor

IBM Journal of Research and Development
A variable instruction stream extension to the VLIW architecture

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
A timed Petri-net model for fine-grain loop scheduling

PLDI '91 Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation
Software pipelining: an evaluation of enhanced pipelining

MICRO 24 Proceedings of the 24th annual international symposium on Microarchitecture
A new technique for induction variable removal

MICRO 24 Proceedings of the 24th annual international symposium on Microarchitecture
GURPR*: a new global software pipelining algorithm

MICRO 24 Proceedings of the 24th annual international symposium on Microarchitecture
MOVE: a framework for high-performance processor design

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Speedup of band linear recurrences in the presence of resource constraints

ICS '92 Proceedings of the 6th international conference on Supercomputing
Sentinel scheduling for VLIW and superscalar processors

ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
A non-deterministic scheduler for a software pipelining compiler

MICRO 25 Proceedings of the 25th annual international symposium on Microarchitecture
A dynamic-programming technique for compacting loops

MICRO 25 Proceedings of the 25th annual international symposium on Microarchitecture
Extraction of massive instruction level parallelism

ACM SIGARCH Computer Architecture News
A novel framework of register allocation for software pipelining

POPL '93 Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Sentinel scheduling: a model for compiler-controlled speculative execution

ACM Transactions on Computer Systems (TOCS)
VLIW compilation techniques in a superscalar environment

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Iterative modulo scheduling: an algorithm for software pipelining loops

MICRO 27 Proceedings of the 27th annual international symposium on Microarchitecture
Instruction scheduling in the TOBEY compiler

IBM Journal of Research and Development
Generating compilers for generated datapaths

EURO-DAC '94 Proceedings of the conference on European design automation
Software pipelining

ACM Computing Surveys (CSUR)
Resource-Constrained Software Pipelining

IEEE Transactions on Parallel and Distributed Systems
Region-based compilation: an introduction and motivation

Proceedings of the 28th annual international symposium on Microarchitecture
Anticipatory instruction scheduling

Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
Software pipelining loops with conditional branches

Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture
A Framework for Resource-Constrained Rate-Optimal Software Pipelining

IEEE Transactions on Parallel and Distributed Systems
GPMB—software pipelining branch-intensive loops

MICRO 26 Proceedings of the 26th annual international symposium on Microarchitecture
Software pipelining: a comparison and improvement

MICRO 23 Proceedings of the 23rd annual workshop and symposium on Microprogramming and microarchitecture
Using a lookahead window in a compaction-based parallelizing compiler

MICRO 23 Proceedings of the 23rd annual workshop and symposium on Microprogramming and microarchitecture
Realistic scheduling: compaction for pipelined architectures

MICRO 23 Proceedings of the 23rd annual workshop and symposium on Microprogramming and microarchitecture
Split-path enhanced pipeline scheduling for loops with control flows

MICRO 31 Proceedings of the 31st annual ACM/IEEE international symposium on Microarchitecture
A global resource-constrained parallelization technique

ICS '89 Proceedings of the 3rd international conference on Supercomputing
A comparative study of modulo scheduling techniques

ICS '02 Proceedings of the 16th international conference on Supercomputing
Enhancing loop buffering of media and telecommunications applications using low-overhead predication

Proceedings of the 34th annual ACM/IEEE international symposium on Microarchitecture
Control Flow Regeneration for Software Pipelined Loops with Conditions

International Journal of Parallel Programming
Three Architectural Models for Compiler-Controlled Speculative Execution

IEEE Transactions on Computers
Requirements for Optimal Execution of Loops with Tests

IEEE Transactions on Parallel and Distributed Systems
Making Compaction-Based Parallelization Affordable

IEEE Transactions on Parallel and Distributed Systems
A finite state machine based format model of software pipelined loops with conditions

Progress in computer research
Copy Elimination for Parallelizing Compilers

LCPC '98 Proceedings of the 11th International Workshop on Languages and Compilers for Parallel Computing
Decomposed Software Pipelining: A New Approach to Exploit Instruction Level Parallelism for Loop Programs

PACT '93 Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism
Software Pipelining of Nested Loops

CC '01 Proceedings of the 10th International Conference on Compiler Construction
A First Step Towards Time Optimal Software Pipelining of Loops with Control Flows

CC '01 Proceedings of the 10th International Conference on Compiler Construction
Predicated Software Pipelining Technique for Loops with Conditions

IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
A timed Petri-net model for fine-grain loop scheduling

CASCON '91 Proceedings of the 1991 conference of the Centre for Advanced Studies on Collaborative research
Register allocation for optimal loop scheduling

CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing - Volume 2
Probabilistic Predicate-Aware Modulo Scheduling

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Software pipelining: an effective scheduling technique for VLIW machines

ACM SIGPLAN Notices - Best of PLDI 1979-1999
Compiler-Directed ILP Extraction for Clustered VLIW/EPIC Machines: Predication, Speculation and Modulo Scheduling

DATE '03 Proceedings of the conference on Design, Automation and Test in Europe - Volume 1
Using a lookahead window in a compaction-based parallelizing compiler

ACM SIGMICRO Newsletter
How many threads to spawn during program multithreading?

LCPC'10 Proceedings of the 23rd international conference on Languages and compilers for parallel computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

We describe a compilation algorithm for efficient software pipelining of general inner loops, where the number of iterations and the time taken by each iteration may be unpredictable, due to arbitrary if-then- else statements and conditional exit statements within the loop. As our target machine, we assume a wide instruction word architecture that allows multi-way branching in the form of if-then-else trees, and that allows conditional register transfers depending on where the microinstruction branches to (a hardware implementation proposal for such a machine is briefly described in the paper). Our compilation algorithm, which we call the pipeline scheduling technique, produces a software- pipelined version of a given inner loop, which allows a new iteration of the loop to begin on every cycle whenever dependencies and resources permit. The correctness and termination properties of the algorithm are studied in the paper.