Nested transactions: an approach to reliable distributed computing
Nested transactions: an approach to reliable distributed computing
Optimistic recovery in distributed systems
ACM Transactions on Computer Systems (TOCS)
Checkpointing and Rollback-Recovery for Distributed Systems
IEEE Transactions on Software Engineering - Special issue on distributed systems
Communications of the ACM
ACM Transactions on Computer Systems (TOCS)
Distributed snapshots: determining global states of distributed systems
ACM Transactions on Computer Systems (TOCS)
Replication and fault-tolerance in the ISIS system
Proceedings of the tenth ACM symposium on Operating systems principles
Distributed transactions for reliable systems
Proceedings of the tenth ACM symposium on Operating systems principles
The Recovery Manager of the System R Database Manager
ACM Computing Surveys (CSUR)
SOSP '81 Proceedings of the eighth ACM symposium on Operating systems principles
A message system supporting fault tolerance
SOSP '83 Proceedings of the ninth ACM symposium on Operating systems principles
ARGUS REFERENCE MANUAL
Availability in the Sprite distributed file system
ACM SIGOPS Operating Systems Review
Availability in the Sprite distributed file system
EW 4 Proceedings of the 4th workshop on ACM SIGOPS European workshop
Implementing a Semi-Active Replication Strategy in CHORUS/Classix, a Distributed Real-Time Executive
SRDS '99 Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems
Hi-index | 0.00 |
We consider the problem of providing automatic and transparent fault tolerance to arbitrary user computations based on the Mach operating system. Among the several alternatives for structuring such a system, we pursue the "task-pair backup" paradigm in detail and outline how it might be supported by Mach. Some of the new system calls and protocols that need to be incorporated into the Mach kernel and server tasks are sketched.