Optimistic recovery in distributed systems
ACM Transactions on Computer Systems (TOCS)
ACM Transactions on Programming Languages and Systems (TOPLAS)
Distributed discrete-event simulation
ACM Computing Surveys (CSUR)
Performance evaluation of the time warp distributed simulation mechanism
Performance evaluation of the time warp distributed simulation mechanism
An execution model for distributed object-oriented computation
OOPSLA '88 Conference proceedings on Object-oriented programming systems, languages and applications
Recovery in distributed systems using asynchronous message logging and checkpointing
PODC '88 Proceedings of the seventh annual ACM Symposium on Principles of distributed computing
Parallel execution of sequential scheme with ParaTran
LFP '88 Proceedings of the 1988 ACM conference on LISP and functional programming
Efficient distributed recovery using message logging
Proceedings of the eighth annual ACM Symposium on Principles of distributed computing
Parallel discrete event simulation
Communications of the ACM - Special issue on simulation
On optimistic methods for concurrency control
ACM Transactions on Database Systems (TODS)
A Majority consensus approach to concurrency control for multiple copy databases
ACM Transactions on Database Systems (TODS)
Fail-stop processors: an approach to designing fault-tolerant computing systems
ACM Transactions on Computer Systems (TOCS)
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
Weighted voting for replicated data
SOSP '79 Proceedings of the seventh ACM symposium on Operating systems principles
The failure and recovery problem for replicated databases
PODC '83 Proceedings of the second annual ACM symposium on Principles of distributed computing
Replicated objects in time warp simulations
WSC '92 Proceedings of the 24th conference on Winter simulation
A replication structure for efficient and fault-tolerant parallel and distributed simulations
SpringSim '10 Proceedings of the 2010 Spring Simulation Multiconference
Federate Fault Tolerance in HLA-Based Simulation
PADS '10 Proceedings of the 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
Hi-index | 14.98 |
A recovery protocol for distributed systems using the time warp control mechanism is described. The proposed protocol is fault tolerant to multiple process failures. Time warp is an optimistic execution technique in which synchronization is achieved using rollback. The recovery protocol exploits the redundancy already available to implement process rollback in the time warp mechanism. Thus, the protocol has little additional bookkeeping overhead, which contrasts with many other recovery protocols.