Evaluation of scheduling and allocation algorithms while mapping assembly code onto FPGAs

Authors:
David Zaretsky;Gaurav Mittal;Xiaoyong Tang;Prith Banerjee
Affiliations:
Northwestern University, Evanston, IL;Northwestern University, Evanston, IL;Northwestern University, Evanston, IL;Northwestern University, Evanston, IL
Venue:
Proceedings of the 14th ACM Great Lakes symposium on VLSI
Year:
2004

Citing 9
Cited 2

Specifying representations of machine instructions

ACM Transactions on Programming Languages and Systems (TOPLAS)
Quality and speed in linear-scan register allocation

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Advanced compiler design and implementation

Advanced compiler design and implementation
Linear scan register allocation

ACM Transactions on Programming Languages and Systems (TOPLAS)
Synthesis and Optimization of Digital Circuits

Synthesis and Optimization of Digital Circuits
Hardware/software partitioning of software binaries

Proceedings of the 2002 IEEE/ACM international conference on Computer-aided design
Dynamic hardware/software partitioning: a first approach

Proceedings of the 40th annual Design Automation Conference
Register allocation & spilling via graph coloring

SIGPLAN '82 Proceedings of the 1982 SIGPLAN symposium on Compiler construction
The New Jersey machine-code toolkit

TCON'95 Proceedings of the USENIX 1995 Technical Conference Proceedings

Automatic translation of software binaries onto FPGAs

Proceedings of the 41st annual Design Automation Conference
Automatic extraction of function bodies from software binaries

Proceedings of the 2005 Asia and South Pacific Design Automation Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

Migration of software from older general purpose embedded processors onto newer mixed hardware/software Systems-On-Chip (SOC) platforms is becoming an increasingly important topic. Automatic translation of general purpose software binaries and assembly code onto hardware implementations using FPGAs require sophisticated scheduling and allocation algorithms to maximize the resource utilization of such hardware devices. This paper describes the effects of scheduling and chaining of node operations in a CDFG onto an FPGA. The effects of register allocation on scheduled nodes are also discussed. The Texas Instruments C6000 DSP processor architecture was chosen as the DSP processor platform and assembly code, and the Xilinx Virtex II XC2V250 was chosen as the target FPGA. Results are reported on ten benchmarks, which show that scheduling with chaining operations produces the best results on FPGAs, while the addition of register allocation in fact generates poorer designs in terms of area and frequency.