Dynamic loop pipelining in data-driven architectures

Authors:
Joäo M. P. Cardoso
Affiliations:
University of Algarve, Lisbon, Portugal
Venue:
Proceedings of the 2nd conference on Computing frontiers
Year:
2005

Citing 26
Cited 3

The program dependence graph and its use in optimization

ACM Transactions on Programming Languages and Systems (TOPLAS)
Wavefront Array Processors-Concept to Implementation

Computer
Dataflow machine architecture

ACM Computing Surveys (CSUR)
A Data-Driven VLSI Array for Arbitrary Algorithms

Computer
Compiling for dataflow software pipelining

Selected papers of the second workshop on Languages and compilers for parallel computing
Improving register allocation for subscripted variables

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
The program dependence web: a representation supporting control-, data-, and demand-driven interpretation of imperative languages

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
From control flow to dataflow

Journal of Parallel and Distributed Computing - Special issue on shared-memory multiprocessors
Efficiently computing static single assignment form and the control dependence graph

ACM Transactions on Programming Languages and Systems (TOPLAS)
Iterative modulo scheduling: an algorithm for software pipelining loops

MICRO 27 Proceedings of the 27th annual international symposium on Microarchitecture
Software pipelining

ACM Computing Surveys (CSUR)
Adapting software pipelining for reconfigurable computing

CASES '00 Proceedings of the 2000 international conference on Compilers, architecture, and synthesis for embedded systems
A decade of reconfigurable computing: a visionary retrospective

Proceedings of the conference on Design, automation and test in Europe
Performance-constrained pipelining of software loops onto reconfigurable hardware

FPGA '02 Proceedings of the 2002 ACM/SIGDA tenth international symposium on Field-programmable gate arrays
A Code Mapping Scheme for Dataflow Software Pipelining

A Code Mapping Scheme for Dataflow Software Pipelining
XPP-VC: A C Compiler with Temporal Partitioning for the PACT-XPP Architecture

FPL '02 Proceedings of the Reconfigurable Computing Is Going Mainstream, 12th International Conference on Field-Programmable Logic and Applications
Very Large Scale Spatial Computing

UMC '02 Proceedings of the Third International Conference on Unconventional Models of Computation
A C to HDL Compiler for Pipeline Processing on FPGAs

FCCM '00 Proceedings of the 2000 IEEE Symposium on Field-Programmable Custom Computing Machines
PACT XPP—A Self-Reconfigurable Data Processing Architecture

The Journal of Supercomputing
WaveScalar

Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture
Advances in dataflow programming languages

ACM Computing Surveys (CSUR)
Exploiting Loop-Level Parallelism on Coarse-Grained Reconfigurable Architectures Using Modulo Scheduling

DATE '03 Proceedings of the conference on Design, Automation and Test in Europe - Volume 1
Spatial computation

Spatial computation
Spatial computation

ASPLOS XI Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
An Asynchronous Dataflow FPGA Architecture

IEEE Transactions on Computers
Pipeline vectorization

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

The implementation of a coarse-grained reconfigurable architecture with loop self-pipelining

ARC'07 Proceedings of the 3rd international conference on Reconfigurable computing: architectures, tools and applications
Designing a coarse-grained reconfigurable architecture using loop self-pipelining

ACSAC'06 Proceedings of the 11th Asia-Pacific conference on Advances in Computer Systems Architecture
Data-driven regular reconfigurable arrays: design space exploration and mapping

SAMOS'05 Proceedings of the 5th international conference on Embedded Computer Systems: architectures, Modeling, and Simulation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data-driven array architectures seem to be important alternatives for coarse-grained reconfigurable computing platforms. Their use has provided performance improvements over microprocessors and shorter programming cycles than FPGA-based platforms. As with other architectures, in data-driven architectures loop pipelining plays an important role to improve performance. Usually this kind of pipelining can be achieved using the dataflow software pipelining technique or other software pipelining approaches. Although performance improvements are achieved, those techniques heavily depend on the insertion of pipelining stages and thus require complex balancing efforts. Furthermore, those techniques statically define the pipelining and do not take fully advantage of the dynamic scheduling attainable by the data-driven concept. This paper presents a novel scheme to pipeline loops in data-driven architectures, orchestrated by a handshaking protocol. Using the new approach, self loop pipelining is naturally achieved. The scheme is based on duplicating cyclic hardware structures, in order they are autonomously executed, with synchronization being achieved by the data flow. It can be applied to nested loops, requires less aggressive pipeline balancing efforts than usual software pipelining techniques, and innermost loops with conditional structures can be pipelined without conservative pipelining implementationsWe show results of using the proposed technique when mapping algorithms in imperative programming languages to the PACT eXtreme Processing Platform (XPP). The results confirm improvements over the use of conventional loop pipelining techniques. Better performance and fewer resources are achieved in a number of cases.