End-to-end register data-flow continuous self-test

  • Authors:
  • Javier Carretero;Pedro Chaparro;Xavier Vera;Jaume Abella;Antonio González

  • Affiliations:
  • Intel Barcelona Research Center, Intel Labs - UPC, Barcelona, Spain;Intel Barcelona Research Center, Intel Labs - UPC, Barcelona, Spain;Intel Barcelona Research Center, Intel Labs - UPC, Barcelona, Spain;Intel Barcelona Research Center, Intel Labs - UPC, Barcelona, Spain;Intel Barcelona Research Center, Intel Labs - UPC, Barcelona, Spain

  • Venue:
  • Proceedings of the 36th annual international symposium on Computer architecture
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

While Moore's Law predicts the ability of semi-conductor industry to engineer smaller and more efficient transistors and circuits, there are serious issues not contemplated in that law. One concern is the verification effort of modern computing systems, which has grown to dominate the cost of system design. On the other hand, technology scaling leads to burn-in phase out. As a result, in-the-field error rate may increase due to both actual errors and latent defects. Whereas data can be protected with arithmetic codes (like parity or ECC), there is a lack of cost-effective mechanisms for control logic. This paper presents a light-weight microarchitectural mechanism that ensures that data consumed through registers are correct. Microarchitecture presents a new way to manage reliability and testing without significantly sacrificing cost and performance, offering a unique opportunity to detect errors in the field at low cost. Our results show a coverage around 90% for the targeted structures with a cost in power and area of about 4%. The structures protected include the issue queue logic and the data associated (i.e., tags, control signals), input multiplexors, rename data, replay logic, register free list, bypasses data and logic, MOB data and addresses, register file logic, register file storage and functional units.