Implementing fault-tolerant services using the state machine approach: a tutorial
ACM Computing Surveys (CSUR)
Comparing primary-backup and state machines for crash failures
PODC '96 Proceedings of the fifteenth annual ACM symposium on Principles of distributed computing
Distributed systems (2nd Ed.)
Reaching Agreement in the Presence of Faults
Journal of the ACM (JACM)
Byzantine generals in action: implementing fail-stop processors
ACM Transactions on Computer Systems (TOCS)
An n log n algorithm for minimizing states in a finite automaton
An n log n algorithm for minimizing states in a finite automaton
Replication for web hosting systems
ACM Computing Surveys (CSUR)
Replication for web hosting systems
ACM Computing Surveys (CSUR)
Replication algorithms for the World-Wide Web
Journal of Systems Architecture: the EUROMICRO Journal
Distributed Computing
Fusible Data Structures for Fault-Tolerance
ICDCS '07 Proceedings of the 27th International Conference on Distributed Computing Systems
Hi-index | 0.00 |
Given a set of n different deterministic finite state machines (DFSMs), we examine the problem of tolerating k faults among them. The traditional approach to this problem involves replication, requiring n.k backup DFSMs. For example, given two state machines, say A and B, to tolerate two faults, this approach maintains two copies each of A and B, thus resulting in a total of six DFSMs in the system. In this paper, we question the optimality of such an approach and present another approach based on the 'fusion' of state machines allowing for more eÆcient backups. We introduce the theory of fusion machines and provide an algorithm which can generate fusion machines corresponding to a given set of machines. Further, we have implemented this algorithm and tested it for various examples. It is important to note that our approach requires only k backup DFSMs, as opposed to the n.k backup DFSMs required by the replication approach.