DDMR: Dynamic and Scalable Dual Modular Redundancy with Short Validation Intervals

  • Authors:
  • Amit Golander;Shlomo Weiss;Ronny Ronen

  • Affiliations:
  • Tel Aviv University, Tel Aviv;Tel Aviv University, Tel Aviv;Intel Haifa, Haifa

  • Venue:
  • IEEE Computer Architecture Letters
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

DMR (Dual Modular Redundancy) was suggested for increasing reliability. Classical DMR consists of pairs of cores that check each other and are pre-connected during manufacturing by dedicated links. In this paper we introduce the Dynamic Dual Modular Redundancy (DDMR) architecture. DDMR supports run-time scheduling of redundant threads, which has significant benefits relative to static binding. To allow dynamic pairing, DDMR replaces the special links with a novel ring architecture. DDMR uses short instruction sequences for validation, smaller than the processor reorder buffer. Such short sequences reduce latencies in parallel programs and save resources needed to buffer uncommitted data. DDMR scales with the number of cores and may be used in large multicore architectures.