The effect of LUT and cluster size on deep-submicron FPGA performance and density

Authors:
Elias Ahmed;Jonathan Rose
Affiliations:
Altera Toronto Technology Center, Toronto, ON, Canada and Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada;Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada
Venue:
IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special section on the 2002 international symposium on low-power electronics and design (ISLPED)
Year:
2004

Citing 14
Cited 45

Field-programmable gate arrays

Field-programmable gate arrays
A new high density and very low cost reprogrammable FPGA architecture

FPGA '99 Proceedings of the 1999 ACM/SIGDA seventh international symposium on Field programmable gate arrays
An innovative, segmented high performance FPGA family with variable-grain-architecture and wide-gating functions

FPGA '99 Proceedings of the 1999 ACM/SIGDA seventh international symposium on Field programmable gate arrays
Using cluster-based logic blocks and timing-driven packing to improve FPGA speed and density

FPGA '99 Proceedings of the 1999 ACM/SIGDA seventh international symposium on Field programmable gate arrays
The effect of LUT and cluster size on deep-submicron FPGA performance and density

FPGA '00 Proceedings of the 2000 ACM/SIGDA eighth international symposium on Field programmable gate arrays
Using sparse crossbars within LUT

FPGA '01 Proceedings of the 2001 ACM/SIGDA ninth international symposium on Field programmable gate arrays
Efficient circuit clustering for area and power reduction in FPGAs

FPGA '02 Proceedings of the 2002 ACM/SIGDA tenth international symposium on Field-programmable gate arrays
On the sensitivity of FPGA architectural conclusions to experimental assumptions, tools, and techniques

FPGA '02 Proceedings of the 2002 ACM/SIGDA tenth international symposium on Field-programmable gate arrays
Architecture and CAD for Deep-Submicron FPGAs

Architecture and CAD for Deep-Submicron FPGAs
The stratixπ routing and logic architecture

FPGA '03 Proceedings of the 2003 ACM/SIGDA eleventh international symposium on Field programmable gate arrays
Hardware-assisted simulated annealing with application for fast FPGA placement

FPGA '03 Proceedings of the 2003 ACM/SIGDA eleventh international symposium on Field programmable gate arrays
Architecture evaluation for power-efficient FPGAs

FPGA '03 Proceedings of the 2003 ACM/SIGDA eleventh international symposium on Field programmable gate arrays
FPGA and CPLD Architectures: A Tutorial

IEEE Design & Test
How Much Logic Should Go in an FPGA Logic Block?

IEEE Design & Test

Using bus-based connections to improve field-programmable gate array density for implementing datapath circuits

Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-programmable gate arrays
Analysis of the Effect of LUT Size on FPGA Area and Delay Using Theoretical Derivations

ISQED '05 Proceedings of the 6th International Symposium on Quality of Electronic Design
A reconfigurable architecture for hybrid CMOS/Nanodevice circuits

Proceedings of the 2006 ACM/SIGDA 14th international symposium on Field programmable gate arrays
A function generator-based reconfigurable system

Proceedings of the 2005 Asia and South Pacific Design Automation Conference
Optimal simultaneous mapping and clustering for FPGA delay optimization

Proceedings of the 43rd annual Design Automation Conference
A new hybrid FPGA with nanoscale clusters and CMOS routing

Proceedings of the 43rd annual Design Automation Conference
Designing efficient input interconnect blocks for LUT clusters using counting and entropy

Proceedings of the 2007 ACM/SIGDA 15th international symposium on Field programmable gate arrays
An asynchronous fpga logic cell implementation

Proceedings of the 17th ACM Great Lakes symposium on VLSI
Evaluating area and performance of hybrid FPGAs with nanoscale clusters and CMOS routing

ACM Journal on Emerging Technologies in Computing Systems (JETC)
Performance and power evaluation of a 3D CMOS/nanomaterial reconfigurable architecture

Proceedings of the 2007 IEEE/ACM international conference on Computer-aided design
Designing Efficient Input Interconnect Blocks for LUT Clusters Using Counting and Entropy

ACM Transactions on Reconfigurable Technology and Systems (TRETS) - Special edition on the 15th international symposium on FPGAs
Area and delay trade-offs in the circuit and architecture design of FPGAs

Proceedings of the 16th international ACM/SIGDA symposium on Field programmable gate arrays
Automated transistor sizing for FPGA architecture exploration

Proceedings of the 45th annual Design Automation Conference
Design space exploration for field programmable compressor trees

CASES '08 Proceedings of the 2008 international conference on Compilers, architectures and synthesis for embedded systems
FPGA Architecture: Survey and Challenges

Foundations and Trends in Electronic Design Automation
VPR 5.0: FPGA cad and architecture exploration tools with single-driver routing, heterogeneity and process scaling

Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
FPCNA: a field programmable carbon nanotube array

Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
Wirelength modeling for homogeneous and heterogeneous FPGA architectural development

Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
Dynamic reconfiguration architectures for multi-context FPGAs

Computers and Electrical Engineering
Design and evaluation of a carbon nanotube-based programmable architecture

International Journal of Parallel Programming
Floating-point FPGA: architecture and modeling

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
FPGA interconnect topologies exploration

International Journal of Reconfigurable Computing - Selected papers from ReCoSoc08
A novel routing architecture for field-programmable gate-arrays

ARCS'08 Proceedings of the 21st international conference on Architecture of computing systems
Using the minimum set of input combinations to minimize the area of local routing networks in logic clusters containing logically equivalent I/Os in FPGAs

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
FPGA architecture optimisation using geometric programming

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
AVGS-Mux style: a novel technology and device independent technique for reducing power and compensating process variations in FPGA fabrics

Proceedings of the Conference on Design, Automation and Test in Europe
Reconfigurable circuit design with nanomaterials

Proceedings of the Conference on Design, Automation and Test in Europe
Power estimation of embedded multiplier blocks in FPGAs

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Exploring area and delay tradeoffs in FPGAs with architecture and automated transistor design

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Towards scalable FPGA CAD through architecture

Proceedings of the 19th ACM/SIGDA international symposium on Field programmable gate arrays
Reducing the pressure on routing resources of FPGAs with generic logic chains

Proceedings of the 19th ACM/SIGDA international symposium on Field programmable gate arrays
Using bus-based connections to improve field-programmable gate-array density for implementing datapath circuits

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Effect of serialized routing resources on the implementation area of datapath circuits on FPGAS

WSEAS Transactions on Computers
mrFPGA: A novel FPGA architecture with memristor-based reconfiguration

NANOARCH '11 Proceedings of the 2011 IEEE/ACM International Symposium on Nanoscale Architectures
VPR 5.0: FPGA CAD and architecture exploration tools with single-driver routing, heterogeneity and process scaling

ACM Transactions on Reconfigurable Technology and Systems (TRETS)
Architecture and performance evaluation of 3D CMOS-NEM FPGA

Proceedings of the System Level Interconnect Prediction Workshop
Reducing the cost of floating-point mantissa alignment and normalization in FPGAs

Proceedings of the ACM/SIGDA international symposium on Field Programmable Gate Arrays
FPGA-RR: an enhanced FPGA architecture with RRAM-based reconfigurable interconnects (abstract only)

Proceedings of the ACM/SIGDA international symposium on Field Programmable Gate Arrays
Reconfigurable Blocks Based on Balanced Ternary

Journal of Signal Processing Systems
Utilizing multi-bit connections to improve the area efficiency of unidirectional routing resources for routing multi-bit signals on FPGAs

Microprocessors & Microsystems
Interlaced switch boxes placement for three-dimensional FPGA architecture design

International Journal of Circuit Theory and Applications
The secure wavelet transform

Journal of Real-Time Image Processing
Hierarchical Benchmark Circuit Generation for FPGA Architecture Evaluation

ACM Transactions on Embedded Computing Systems (TECS) - Special Section on CAPA'09, Special Section on WHS'09, and Special Section VCPSS' 09
Analysis of the area-delay performance of hybrid nanoelectronic memory cores used in field programmable gate arrays

Proceedings of the 23rd ACM international conference on Great lakes symposium on VLSI
Towards development of an analytical model relating FPGA architecture parameters to routability

ACM Transactions on Reconfigurable Technology and Systems (TRETS) - Special Section on 19th Reconfigurable Architectures Workshop (RAW 2012)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we revisit the field-programmable gatearray (FPGA) architectural issue of the effect of logic block functionality on FPGA performance and density. In particular, in the context of lookup table, cluster-based island-style FPGAs (Betz et al. 1997) we look at the effect of lookup table (LUT) size and cluster size (number of LUTs per cluster) on the speed and logic density of an FPGA.We use a fully timing-driven experimental flow (Betz et al. 1997), (Marquardt, 1999) in which a set of benchmark circuits are synthesized into different cluster-based (Betz and Rose, 1997, 1998) and (Marquardt, 1999) logic block architectures, which contain groups of LUTs and flip-flops. Across all architectures with LUT sizes in the range of 2 to 7 inputs, and cluster size from 1 to 10 LUTs, we have experimentally determined the relationship between the number of inputs required for a cluster as a function of the LUT size (K) and cluster size (N). Second, contrary to previous results, we have shown that clustering small LUTs (sizes 2 and 3) produces better area results than what was presented in the past. However, our results also show that the performance of FPGAs with these small LUT sizes is significantly worse (by almost a factor of 2) than larger LUTs. Hence, as measured by area-delay product, or by performance, these would be a bad choice. Also, we have discovered that LUT sizes of 5 and 6 produce much better area results than were previously believed. Finally, our results show that a LUT size of 4 to 6 and cluster size of between 3-10 provides the best area-delay product for an FPGA.