DeSyRe: On-demand system reliability

  • Authors:
  • I. Sourdis;C. Strydis;A. Armato;C. S. Bouganis;B. Falsafi;G. N. Gaydadjiev;S. Isaza;A. Malek;R. Mariani;D. Pnevmatikatos;D. K. Pradhan;G. Rauwerda;R. M. Seepers;R. A. Shafik;K. Sunesen;D. Theodoropoulos;S. Tzilis;M. Vavouras

  • Affiliations:
  • Computer Science and Engineering Dept., Chalmers University of Technology, Sweden;Neurasmus B.V., The Netherlands;Yogitech SpA, Italy;Electrical and Electronic Engineering Dept., Imperial College London, UK;Computer and Communication Sciences, EPFL, Switzerland;Computer Science and Engineering Dept., Chalmers University of Technology, Sweden;Neurasmus B.V., The Netherlands;Computer Science and Engineering Dept., Chalmers University of Technology, Sweden;Yogitech SpA, Italy;Institute of Computer Science (ICS), Foundation for Research and Technology - Hellas (FORTH), Greece and Electronic and Computer Engineering Department, Technical University of Crete, Greece;Computer Science Dept., University of Bristol, UK;Recore Systems B.V., The Netherlands;Neurasmus B.V., The Netherlands;Computer Science Dept., University of Bristol, UK;Recore Systems B.V., The Netherlands;Institute of Computer Science (ICS), Foundation for Research and Technology - Hellas (FORTH), Greece;Computer Science and Engineering Dept., Chalmers University of Technology, Sweden;Electrical and Electronic Engineering Dept., Imperial College London, UK

  • Venue:
  • Microprocessors & Microsystems
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The DeSyRe project builds on-demand adaptive and reliable Systems-on-Chips (SoCs). As fabrication technology scales down, chips are becoming less reliable, thereby incurring increased power and performance costs for fault tolerance. To make matters worse, power density is becoming a significant limiting factor in SoC design, in general. In the face of such changes in the technological landscape, current solutions for fault tolerance are expected to introduce excessive overheads in future systems. Moreover, attempting to design and manufacture a totally defect-/fault-free system, would impact heavily, even prohibitively, the design, manufacturing, and testing costs, as well as the system performance and power consumption. In this context, DeSyRe delivers a new generation of systems that are reliable by design at well-balanced power, performance, and design costs. In our attempt to reduce the overheads of fault-tolerance, only a small fraction of the chip is built to be fault-free. This fault-free part is then employed to manage the remaining fault-prone resources of the SoC. The DeSyRe framework is applied to two medical systems with high safety requirements (measured using the IEC 61508 functional safety standard) and tight power and performance constraints.