Fault-tolerant computing based on Mach
ACM SIGOPS Operating Systems Review
The ISIS project: real experience with a fault tolerant programming system
ACM SIGOPS Operating Systems Review
POSIX.4: programming for the real world
POSIX.4: programming for the real world
ACM Transactions on Computer Systems (TOCS)
Delta Four: A Generic Architecture for Dependable Distributed Computing
Delta Four: A Generic Architecture for Dependable Distributed Computing
Design and Implementation of a Pluggable Fault Tolerant CORBA Infrastructure
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Towards middleware for fault-tolerance in distributed real-time and embedded systems
DAIS'08 Proceedings of the 8th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
Hi-index | 0.00 |
The paper reports a practical implementation of a strategy to support semi-active replication of real-time software components (i.e. sets of tasks) running on the Chorus/ClassiX distributed operating system. The main property of the replication strategy developed in this paper is to solve the major difficulty of replica determinism. The semi-active replication scheme consists of a leader software component and identical follower replicas. Only the leader component sends out application messages as well as notifications indicating the order which messages have been consumed and produced. Dynamic non-deterministic scheduling of tasks within the different replicas may cause the follower tasks to lag in their execution regarding the leader ones.