Diversity and fault avoidance for dependable replication systems

  • Authors:
  • Sung-Hwa Lim;Byoung-Hoon Lee;Jai-Hoon Kim

  • Affiliations:
  • Graduate School of Information and Communication, Ajou University, Suwon 443-749, South Korea;Graduate School of Information and Communication, Ajou University, Suwon 443-749, South Korea;Graduate School of Information and Communication, Ajou University, Suwon 443-749, South Korea

  • Venue:
  • Information Processing Letters
  • Year:
  • 2008

Quantified Score

Hi-index 0.89

Visualization

Abstract

In the hot-standby replication system, the system cannot process its tasks anymore when all replicated nodes have failed. Thus, the remaining living nodes should be well-protected against failure when parts of replicated nodes have failed. Design faults and system-specific weaknesses may cause chain reactions of common faults on identical replicated nodes in replication systems. These can be alleviated by replicating diverse hardware and software. Going one-step forward, failures on the remaining nodes can be suppressed by predicting and preventing the same fault when it has occurred on a replicated node. In this paper, we propose a fault avoidance scheme which increases system dependability by avoiding common faults on remaining nodes when parts of nodes fail, and analyze the system dependability.