Reliable communication in the presence of failures
ACM Transactions on Computer Systems (TOCS)
Exploiting virtual synchrony in distributed systems
SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
Preserving and using context information in interprocess communication
ACM Transactions on Computer Systems (TOCS)
Lightweight causal and atomic group multicast
ACM Transactions on Computer Systems (TOCS)
Distributed process groups in the V Kernel
ACM Transactions on Computer Systems (TOCS)
TOTEM: a reliable ordered delivery protocol for interconnected local-area networks
TOTEM: a reliable ordered delivery protocol for interconnected local-area networks
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
Broadcast Protocols for Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Processor Membership in Asynchronous Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Design and Performance of Horus: A Lightweight Group Communications System
Design and Performance of Horus: A Lightweight Group Communications System
An Adaptive Algorithm for Tolerating Value Faults and Crash Failures
IEEE Transactions on Parallel and Distributed Systems
Dynamic Configuration Management in Reliable Distributed Real-Time Information Systems
IEEE Transactions on Knowledge and Data Engineering
AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects
IEEE Transactions on Computers
Failure Detection vs Group Membership in Fault-Tolerant Distributed Systems: Hidden Trade-Offs
PAPM-PROBMIV '02 Proceedings of the Second Joint International Workshop on Process Algebra and Probabilistic Methods, Performance Modeling and Verification
Performance Analysis of Java Group Toolkits: A Case Study
FIDJI '01 Revised Papers from the International Workshop on Scientific Engineering for Distributed Java Applications
Topology-Aware Algorithms for Large-Scale Communication
Advances in Distributed Systems, Advanced Distributed Computing: From Algorithms to Systems
A multiple bus broadcast protocol resilient to non-cooperative Byzantine faults
FTCS '96 Proceedings of the The Twenty-Sixth Annual International Symposium on Fault-Tolerant Computing (FTCS '96)
Three-tier replication for FT-CORBA infrastructures
Software—Practice & Experience
The object group design pattern
COOTS'96 Proceedings of the 2nd conference on USENIX Conference on Object-Oriented Technologies (COOTS) - Volume 2
Transparent autonomization in CORBA
Computer Networks: The International Journal of Computer and Telecommunications Networking
Semi-passive replication and Lazy Consensus
Journal of Parallel and Distributed Computing
ZooKeeper: wait-free coordination for internet-scale systems
USENIXATC'10 Proceedings of the 2010 USENIX conference on USENIX annual technical conference
Research: Design and analysis of an efficient and reliable atomic multicast protocol
Computer Communications
Avoiding disruptive failovers in transaction processing systems with multiple active nodes
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
Abstract: The Totem system supports fault-tolerant applications in which distributed processes cooperate to perform a common task and in which replicated data must be updated consistently in the presence of asynchrony and faults. Reliable totally ordered delivery of messages to processes within process groups is provided on a single local-area network or over multiple local-area networks interconnected by gateways. Message ordering is consistent across the entire network, despite processor and communication faults, without requiring all processes to deliver all messages. The Totem system handles processor failure and recovery, as well as network partitioning and remerging, and provides membership and topology maintenance services.