Fault tolerant mechanism to improve yield in NoCs using a reconfigurable router

  • Authors:
  • Caroline Concatto;Debora Matos;Luigi Carro;Fernanda Kastensmidt;Altamiro Susin;Erika Cota;Marcio Kreutz

  • Affiliations:
  • UFRGS - Federal University of Rio Grande do Sul, Porto Alegre, Brazil;UFRGS - Federal University of Rio Grande do Sul, Porto Alegre, Brazil;UFRGS - Federal University of Rio Grande do Sul, Porto Alegre, Brazil;UFRGS - Federal University of Rio Grande do Sul, Porto Alegre, Brazil;UFRGS - Federal University of Rio Grande do Sul, Porto Alegre, Brazil;UFRGS - Federal University of Rio Grande do Sul, Porto Alegre, Brazil;UFRN - University of Rio Grande do Norte, Natal, Brazil

  • Venue:
  • Proceedings of the 22nd Annual Symposium on Integrated Circuits and System Design: Chip on the Dunes
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

As the complexity of designs increase and technologies scale down, devices are subject to new types of malfunctions and failures. Network-on-chip routers are responsible to ensure the proper communication of on-chip cores, and the buffers present in the router channels are crucial to ensure the communication performance. However, faults can affect the routers services, thus compromising the communication integrity and the whole operation of the system. This work proposes the simultaneous use of Reconfiguration, Hamming Code and Triple Modular Redundancy (TMR) to ensure fault tolerance in the FIFOs and links of the network-on-chips (NoCs). The proposed router can dynamically stop using faulty buffers and, to sustain performance, borrow other buffer units from its neighbor channels whenever necessary. The Hamming Code protects the data in the links against a fault in a wire, while TMR is used to protect the control of the FIFO. The new router increases the reliability in 63% and shows low latency and power when compared to the original router. The HW overhead is 77% more gates, used to improve the yield and the system lifetime in comparison to the usage of the reconfigurable router just for performance increase in the NoC.