On group communication in large-scale distributed systems
ACM SIGOPS Operating Systems Review
Programming distributed fault tolerant systems: the replicAda approach
Proceedings of the conference on TRI-Ada '97
Client-Access Protocols for Replicated Services
IEEE Transactions on Software Engineering
On group communication in large-scale distributed systems
EW 6 Proceedings of the 6th workshop on ACM SIGOPS European workshop: Matching operating systems to application needs
On the Quality of Service of Failure Detectors
IEEE Transactions on Computers
Scalable Stability Detection Using Logical Hypercube
IEEE Transactions on Parallel and Distributed Systems
A hierarchical fault-tolerant ring protocol for distributed real-time systems
Parallel and distributed real-time systems
Agreement Problems in Fault-Tolerant Distributed Systems
SOFSEM '01 Proceedings of the 28th Conference on Current Trends in Theory and Practice of Informatics Piestany: Theory and Practice of Informatics
Group, majority, and strict agreement in timed asynchronous distributed systems
FTCS '96 Proceedings of the The Twenty-Sixth Annual International Symposium on Fault-Tolerant Computing (FTCS '96)
Strong and weak virtual synchrony in Horus
SRDS '96 Proceedings of the 15th Symposium on Reliable Distributed Systems
Scalable Stability Detection Using Logical Hypercube
SRDS '99 Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems
ITRA: Inter-Tier Relationship Architecture for End-to-end QoS
The Journal of Supercomputing
Performance study of a probabilistic multicast transport protocol
Performance Evaluation
Future directions in distributed computing
Journal of Intelligent Manufacturing
Hi-index | 0.00 |
Distributed systems that span large geographic distances or manage large numbers of objects are already common place. In such systems, programming applications with even modest reliability requirements that run correctly and efficiently is a difficult task due to the asynchrony and the possibility of complex failure scenarios. In this paper, we describe the architecture of the Relacs communication system that constitutes the "microkernel" of a layered approach to system support for reliable computing in large-scale distributed systems. Relacs is highly portable and implements a very small number of abstractions and primitives that are sufficient for building a variety of interesting higher-level paradigms.