Error recovery in asynchronous systems
IEEE Transactions on Software Engineering
Atomic actions for fault-tolerance using CSP
IEEE Transactions on Software Engineering
Real-time systems and their programming languages
Real-time systems and their programming languages
Understanding fault-tolerant distributed systems
Communications of the ACM
A response to Cheriton and Skeen's criticism of causal and totally ordered communication
ACM SIGOPS Operating Systems Review
Reaching Agreement in the Presence of Faults
Journal of the ACM (JACM)
Reliable Distributed Computing with the ISIS Toolkit
Reliable Distributed Computing with the ISIS Toolkit
Delta Four: A Generic Architecture for Dependable Distributed Computing
Delta Four: A Generic Architecture for Dependable Distributed Computing
Fault Tolerance: Principles and Practice
Fault Tolerance: Principles and Practice
Implementation of the Conversation Scheme in Message-Based Distributed Computer Systems
IEEE Transactions on Parallel and Distributed Systems
Implementations and Extensions of the Conversation Concept
Proceedings of the 5th International GI/ITG/GMA Conference on Fault-Tolerant Computing Systems, Tests, Diagnosis, Fault Treatment
Fault Tolerance in Concurrent Object-Oriented Software through Coordinated Error Recovery
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Newtop: a fault-tolerant group communication protocol
ICDCS '95 Proceedings of the 15th International Conference on Distributed Computing Systems
Hi-index | 0.00 |
The purpose of this paper is to propose a way of tolerating software (design) faults in distributed systems relying on the well-known conversation (atomic action) approach. To do this, we shall consider differences between two programming paradigms: group communication and conversations, and discuss how a group communication service can be used to provide design fault tolerance by conversations. The main characteristics and peculiarities of this new conversational group service are described.