Hardware-assisted simulated annealing with application for fast FPGA placement

Authors:
Michael G. Wrighton;André M. DeHon
Affiliations:
California Institute of Technology, Pasadena, CA;California Institute of Technology, Pasadena, CA
Venue:
FPGA '03 Proceedings of the 2003 ACM/SIGDA eleventh international symposium on Field programmable gate arrays
Year:
2003

Citing 12
Cited 14

VLSI cell placement techniques

ACM Computing Surveys (CSUR)
Parallel algorithms for VLSI computer-aided design

Parallel algorithms for VLSI computer-aided design
Trading quality for compile time: ultra-fast placement for FPGAs

FPGA '99 Proceedings of the 1999 ACM/SIGDA seventh international symposium on Field programmable gate arrays
Parallel algorithms for FPGA placement

GLSVLSI '00 Proceedings of the 10th Great Lakes symposium on VLSI
Clock rate versus IPC: the end of the road for conventional microarchitectures

Proceedings of the 27th annual international symposium on Computer architecture
Runtime and quality tradeoffs in FPGA placement and routing

FPGA '01 Proceedings of the 2001 ACM/SIGDA ninth international symposium on Field programmable gate arrays
Architecture and CAD for Deep-Submicron FPGAs

Architecture and CAD for Deep-Submicron FPGAs
Stochastic, spatial routing for hypergraphs, trees, and meshes

FPGA '03 Proceedings of the 2003 ACM/SIGDA eleventh international symposium on Field programmable gate arrays
VPR: A new packing, placement and routing tool for FPGA research

FPL '97 Proceedings of the 7th International Workshop on Field-Programmable Logic and Applications
A novel net weighting algorithm for timing-driven placement

Proceedings of the 2002 IEEE/ACM international conference on Computer-aided design
Congestion minimization during placement without estimation

Proceedings of the 2002 IEEE/ACM international conference on Computer-aided design
Hardware-Assisted Fast Routing

FCCM '02 Proceedings of the 10th Annual IEEE Symposium on Field-Programmable Custom Computing Machines

Fast timing-driven partitioning-based placement for island style FPGAs

Proceedings of the 40th annual Design Automation Conference
BEE2: A High-End Reconfigurable Computing System

IEEE Design & Test
Defect tolerance at the end of the roadmap

Nano, quantum and molecular computing
New timing and routability driven placement algorithms for FPGA synthesis

Proceedings of the 17th ACM Great Lakes symposium on VLSI
An overview of reconfigurable hardware in embedded systems

EURASIP Journal on Embedded Systems
High-quality, deterministic parallel placement for FPGAs on commodity hardware

Proceedings of the 16th international ACM/SIGDA symposium on Field programmable gate arrays
Criticality history guided FPGA placement algorithm for timing optimization

Proceedings of the 18th ACM Great Lakes symposium on VLSI
Reconfigurable Computing: The Theory and Practice of FPGA-Based Computation

Reconfigurable Computing: The Theory and Practice of FPGA-Based Computation
Improving FPGA placement with dynamically adaptive stochastic tunneling

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Multilevel optimization for large-scale hierarchical FPGA placement

Journal of Computer Science and Technology
Scalable and deterministic timing-driven parallel placement for FPGAs

Proceedings of the 19th ACM/SIGDA international symposium on Field programmable gate arrays
The effect of LUT and cluster size on deep-submicron FPGA performance and density

IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special section on the 2002 international symposium on low-power electronics and design (ISLPED)
Efficient and Deterministic Parallel Placement for FPGAs

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Spatial hardware implementation for sparse graph algorithms in GraphStep

ACM Transactions on Autonomous and Adaptive Systems (TAAS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

To truly exploit FPGAs for rapid turn-around development and prototyping, placement times must be reduced to seconds; late-bound, reconfigurable computing applications may demand placement times as short as microseconds. In this paper, we show how a systolic structure can accelerate placement by assigning one processing element to each possible location for an FPGA LUT from a design netlist. We demonstrate that our technique approaches the same quality point as traditional simulated annealing as measured by a simple linear wirelength metric. Experimental results look ahead to compare quality against VPR's fast placer when considering the minimum channel width required to route as the primary optimization criteria. Preliminary results from an FPGA implementation show the feasibility of accelerating simulated annealing by three orders of magnitude using this approach. This means we can place the largest design in the University of Toronto's "FPGA Placement and Routing Challenge" in around 4ms.