Preserving and using context information in interprocess communication
ACM Transactions on Computer Systems (TOCS)
Unreliable failure detectors for asynchronous systems (preliminary version)
PODC '91 Proceedings of the tenth annual ACM symposium on Principles of distributed computing
The ISIS project: real experience with a fault tolerant programming system
ACM SIGOPS Operating Systems Review
Lightweight causal and atomic group multicast
ACM Transactions on Computer Systems (TOCS)
The process group approach to reliable distributed computing
Communications of the ACM
Impossibility of distributed consensus with one faulty process
Journal of the ACM (JACM)
Distributed process groups in the V Kernel
ACM Transactions on Computer Systems (TOCS)
RELACS: A Communications Infrastructure for Constructing Reliable Applications in Large-Scale Distributed Systems
Hi-index | 0.00 |
An increasing number of applications with reliability requirements are being deployed in distributed systems that span large geographic distances or manage large numbers of objects. We consider the process group mechanism as an appropriate application structuring paradigm in such large-scale distributed systems. We give a formal characterization for the attribute "large scale" as applied to distributed systems and examine the technical problems that need to be solved in making group technology scalable. Our design advocates multiple roles for group membership over a minimal set of abstractions and primitives. The design is currently being implemented on top of "off-the-shelf" technologies for both communication and computation.