Methodologies for Tolerating Cell and Interconnect Faults in FPGAs

Authors:
Fran Hanchek;Shantanu Dutt
Affiliations:
Intel Corp., Hillsboro, OR;Univ. of Illinois at Chicago, Chicago
Venue:
IEEE Transactions on Computers
Year:
1998

Citing 11
Cited 29

Some Practical Issues in the Design of Fault-Tolerant Multiprocessors

IEEE Transactions on Computers - Special issue on fault-tolerant computing
REMOD: a new methodology for designing fault-tolerant arithmetic circuits

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Node-covering, Error-correcting Codes and Multiprocessors with Very High Average Fault Tolerance

IEEE Transactions on Computers
Field-Programmable Gate Array Technology

Field-Programmable Gate Array Technology
On Routability for FPGAs under Faulty Conditions

IEEE Transactions on Computers
Design Methodologies for Tolerating Cell and Interconnect Faults in FPGAs

ICCD '96 Proceedings of the 1996 International Conference on Computer Design, VLSI in Computers and Processors
BIST-Based Diagnostics of FPGA Logic Blocks

Proceedings of the IEEE International Test Conference
Node-Covering Based Defect and Fault Tolerance Methods for Increased Yield in FPGAs

VLSID '96 Proceedings of the 9th International Conference on VLSI Design: VLSI in Mobile Communication
On the diagnosis of programmable interconnect systems: Theory and application

VTS '96 Proceedings of the 14th IEEE VLSI Test Symposium
An approach for testing programmable/configurable field programmable gate arrays

VTS '96 Proceedings of the 14th IEEE VLSI Test Symposium
Built-in self-test of logic blocks in FPGAs (Finally, a free lunch: BIST without overhead!)

VTS '96 Proceedings of the 14th IEEE VLSI Test Symposium

Self-replicating and self-repairing multicellular automata

Artificial Life - Special issue on self-replication
Tolerating operational faults in cluster-based FPGAs

FPGA '00 Proceedings of the 2000 ACM/SIGDA eighth international symposium on Field programmable gate arrays
Efficient incremental rerouting for fault reconfiguration in field programmable gate arrays

ICCAD '99 Proceedings of the 1999 IEEE/ACM international conference on Computer-aided design
A memory coherence technique for online transient error recovery of FPGA configurations

FPGA '01 Proceedings of the 2001 ACM/SIGDA ninth international symposium on Field programmable gate arrays
Run-Time defect tolerance using JBits

FPGA '01 Proceedings of the 2001 ACM/SIGDA ninth international symposium on Field programmable gate arrays
A Fault Tolerant Technique for FPGAs

Journal of Electronic Testing: Theory and Applications
A boolean satisfiability-based incremental rerouting approach with application to FPGAs

Proceedings of the conference on Design, automation and test in Europe
FPGA test time reduction through a novel interconnect testing scheme

FPGA '02 Proceedings of the 2002 ACM/SIGDA tenth international symposium on Field-programmable gate arrays
A search-based bump-and-refit approach to incremental routing for ECO applications in FPGAs

Proceedings of the 2001 IEEE/ACM international conference on Computer-aided design
A search-based bump-and-refit approach to incremental routing for ECO applications in FPGAs

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Efficient Decomposition Techniques for FPGAs

HiPC '02 Proceedings of the 9th International Conference on High Performance Computing
Performance Penalty for Fault Tolerance in Roving STARs

FPL '00 Proceedings of the The Roadmap to Reconfigurable Computing, 10th International Workshop on Field-Programmable Logic and Applications
Dynamic Fault Tolerance in FPGAs via Partial Reconfiguration

FCCM '00 Proceedings of the 2000 IEEE Symposium on Field-Programmable Custom Computing Machines
A Reliable LZ Data Compressor on Reconfigurable Coprocessors

FCCM '00 Proceedings of the 2000 IEEE Symposium on Field-Programmable Custom Computing Machines
BIST-Based Detection and Diagnosis of Multiple Faults in FPGAs

ITC '00 Proceedings of the 2000 IEEE International Test Conference
On-Line Testing of Transient and Crosstalk Faults Affecting Interconnections of FPGA-Implemented Systems

ITC '01 Proceedings of the 2001 IEEE International Test Conference
Using Roving STARs for On-Line Testing and Diagnosis of FPGAs in Fault-Tolerant Applications

ITC '99 Proceedings of the 1999 IEEE International Test Conference
Detecting, diagnosing, and tolerating faults in SRAM-based field programmable gate arrays: a survey

IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special section on the 2001 international conference on computer design (ICCD)
Efficient on-line testing of FPGAs with provable diagnosabilities

Proceedings of the 41st annual Design Automation Conference
Efficient Realization of Parity Prediction Functions in FPGAs

Journal of Electronic Testing: Theory and Applications
Analysis of yield loss due to random photolithographic defects in the interconnect structure of FPGAs

Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-programmable gate arrays
Area Minimization of Exclusive-OR Intensive Circuits in FPGAs

Journal of Electronic Testing: Theory and Applications
Low-cost and highly reliable detector for transient and crosstalk faults affecting FPGA interconnects

Proceedings of the conference on Design, automation and test in Europe: Proceedings
A survey of fault tolerant methodologies for FPGAs

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Design of the EPLD-based reconfigurable fault-tolerant systems with cell-level redundancy

Automation and Remote Control
Online fault tolerance for FPGA logic blocks

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Self-adaptive system for addressing permanent errors in on-chip interconnects

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Progress in autonomous fault recovery of field programmable gate arrays

ACM Computing Surveys (CSUR)
Reliability and availability in reconfigurable computing: a basis for a common solution

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Quantified Score

Hi-index	14.98

Visualization

Abstract

The very high levels of integration and submicron device sizes used in current and emerging VLSI technologies for FPGAs lead to higher occurrences of defects and operational faults. Thus, there is a critical need for fault tolerance and reconfiguration techniques for FPGAs to increase chip yields (with factory reconfiguration) and/or system reliability (with field reconfiguration). We first propose techniques utilizing the principle of node-covering to tolerate logic or cell faults in SRAM-based FPGAs. A routing discipline is developed that allows each cell to cover-to be able to replace-its neighbor in a row. Techniques are also proposed for tolerating wiring faults by means of replacement with spare portions. The replaceable portions can be individual segments, or else sets of segments, called “grids”. Fault detection in the FPGAs is accomplished by separate testing, either at the factory or by the user. If reconfiguration around faulty cells and wiring is performed at the factory (with laser-burned fuses, for example), it is completely transparent to the user. In other words, user configuration data loaded into the SRAM remains the same, independent of whether the chip is detect-free or whether it has been reconfigured around defective cells or wiring-a major advantage for hardware vendors who design and sell FPGA-based logic (e.g., glue logic in microcontrollers, video cards, DSP cards) in production-scale quantities. Compared to other techniques for fault tolerance in FPGAs, our methods are shown to provide significantly greater yield improvement, and a 35 percent non-FT chip yield for a 16×16 FPGA is more than doubled