Fault-tolerant scheduling algorithm for distributed real-time systems

  • Authors:
  • T. Tsuchiya;Y. Kakuda;T. Kikuno

  • Affiliations:
  • -;-;-

  • Venue:
  • WPDRTS '95 Proceedings of the 3rd Workshop on Parallel and Distributed Real-Time Systems
  • Year:
  • 1995

Quantified Score

Hi-index 0.00

Visualization

Abstract

In a distributed real-time system, tolerance to faults on processing nodes in the system is achieved by means of redundant nodes and a fault-tolerant scheduling algorithm. Since redundant nodes increase total failure rate of the system, the number of such redundant nodes should be small. This paper proposes a procedure for fault-tolerant scheduling, which realizes fault-tolerance via a small number of redundant nodes. The procedure is based on such a technique that allows multiple copies of a task to be executed concurrently. It achieves efficient utilization of nodes by forcing copies being executed to terminate immediately after having obtained the first result of these copies. A fundamental scheduling algorithm, into which the procedure is incorporated, is defined and its simulation results are shown.