Reliable communication in the presence of failures
ACM Transactions on Computer Systems (TOCS)
Preserving and using context information in interprocess communication
ACM Transactions on Computer Systems (TOCS)
Leases: an efficient fault-tolerant mechanism for distributed file cache consistency
SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
Understanding fault-tolerant distributed systems
Communications of the ACM
Unreliable failure detectors for asynchronous systems (preliminary version)
PODC '91 Proceedings of the tenth annual ACM symposium on Principles of distributed computing
Lightweight causal and atomic group multicast
ACM Transactions on Computer Systems (TOCS)
Understanding the limitations of causally and totally ordered communication
SOSP '93 Proceedings of the fourteenth ACM symposium on Operating systems principles
The Totem single-ring ordering and membership protocol
ACM Transactions on Computer Systems (TOCS)
Impossibility of distributed consensus with one faulty process
Journal of the ACM (JACM)
Horus: a flexible group communication system
Communications of the ACM
Fault-tolerance in air traffic control systems
ACM Transactions on Computer Systems (TOCS)
Fail-awareness in timed asynchronous systems
PODC '96 Proceedings of the fifteenth annual ACM symposium on Principles of distributed computing
On the impossibility of group membership
PODC '96 Proceedings of the fifteenth annual ACM symposium on Principles of distributed computing
An evaluation of flow control in group communication
IEEE/ACM Transactions on Networking (TON)
The Timed Asynchronous Distributed System Model
IEEE Transactions on Parallel and Distributed Systems
ACM Transactions on Computer Systems (TOCS)
ACM Transactions on Computer Systems (TOCS)
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
Fail-Awareness: An Approach to Construct Fail-Safe Systems
Real-Time Systems
The Timely Computing Base Model and Architecture
IEEE Transactions on Computers
Fast Asynchronous Uniform Consensus in Real-Time Distributed Systems
IEEE Transactions on Computers
A Low Latency, Loss Tolerant Architecture and Protocol for Wide Area Group Communication
DSN '00 Proceedings of the 2000 International Conference on Dependable Systems and Networks (formerly FTCS-30 and DCCA-8)
Building Fault-Tolerant Hardware Clocks from COTS Components
DCCA '99 Proceedings of the conference on Dependable Computing for Critical Applications
Group, majority, and strict agreement in timed asynchronous distributed systems
FTCS '96 Proceedings of the The Twenty-Sixth Annual International Symposium on Fault-Tolerant Computing (FTCS '96)
Fail-Awareness: An Approach to Construct Fail-Safe Applications
FTCS '97 Proceedings of the 27th International Symposium on Fault-Tolerant Computing (FTCS '97)
Implementation and performance of a stable-storage service in Unix
SRDS '96 Proceedings of the 15th Symposium on Reliable Distributed Systems
A transparent light-weight group service
SRDS '96 Proceedings of the 15th Symposium on Reliable Distributed Systems
A Fail-Aware Membership Service
SRDS '97 Proceedings of the 16th Symposium on Reliable Distributed Systems
Election Vs. Consensus in Asynchronous Systems
Election Vs. Consensus in Asynchronous Systems
Newtop: a fault-tolerant group communication protocol
ICDCS '95 Proceedings of the 15th International Conference on Distributed Computing Systems
Practical wide area group communication
Practical wide area group communication
On the Possibility of Consensus in Asynchronous Systems with Finite Average Response Times
ICDCS '05 Proceedings of the 25th IEEE International Conference on Distributed Computing Systems
The Timely Computing Base Model and Architecture
IEEE Transactions on Computers
A Framework to Support Survivable Web Services
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Consistability: describing usually consistent systems
HotDep'08 Proceedings of the Fourth conference on Hot topics in system dependability
Maximal group membership in ad hoc networks
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
Hi-index | 14.98 |
This paper describes a group communication system, called the timewheel group communication system, that has been designed for a timed asynchronous distributed system model. All protocols in the timewheel frop communication system have been designed to be fail-aware in the sense that a process can detect, at any point in time, whether any of its properties is violated. Although these protocols have been designed to operate in an asynchronous distributed computing environment, they provide timeliness properties. The timewheel group communication system provides nine group communication semantics that a user can dynamically choose from while broadcasting an update. This system provides high throughput, fast delivery and stability times, uses a small number of messages per update broadcast, and evenly distributes the processing load among group members.