Object-Based Checkpoints in Distributed Systems

Authors:
Katsuya Tanaka;Hiroaki Higaki;Makoto Takizawa
Affiliations:
-;-;-
Venue:
WORDS '97 Proceedings of the 3rd Workshop on Object-Oriented Real-Time Dependable Systems - (WORDS '97)
Year:
1997

Citing 16
Cited 0

Reliable communication in the presence of failures

ACM Transactions on Computer Systems (TOCS)
Checkpointing and Rollback-Recovery for Distributed Systems

IEEE Transactions on Software Engineering - Special issue on distributed systems
Concurrency control and recovery in database systems

Concurrency control and recovery in database systems
Groupware: some issues and experiences

Communications of the ACM
Principles and realization strategies of multilevel transaction management

ACM Transactions on Database Systems (TODS)
Distributed snapshots: determining global states of distributed systems

ACM Transactions on Computer Systems (TOCS)
Using semantic knowledge for transaction processing in a distributed database

ACM Transactions on Database Systems (TODS)
Locking Primitives in a Database System

Journal of the ACM (JACM)
Time, clocks, and the ordering of events in a distributed system

Communications of the ACM
Causally Ordering Group Communication Protocol

Proceedings of the 1994 International Conference on Parallel and Distributed Systems
Notes on Data Base Operating Systems

Operating Systems, An Advanced Course
A low-overhead recovery technique using quasi-synchronous checkpointing

ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)
Group communication protocol for flexible distributed systems

ICNP '96 Proceedings of the 1996 International Conference on Network Protocols (ICNP '96)
Distributed checkpointing based on influential messages

ICPADS '96 Proceedings of the 1996 International Conference on Parallel and Distributed Systems
Communication protocol for group of distributed objects

ICPADS '96 Proceedings of the 1996 International Conference on Parallel and Distributed Systems
Global States of a Distributed System

IEEE Transactions on Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

In distributed applications, multiple objects are cooperated to achieve some objectives. The objects may suffer from kinds of faults. If some object o is faulty, o is rolled back to the checkpoint and objects which have received messages from o are also required to be rolled back. In this paper, we define influential messages whose receivers are required to be rolled back if the senders are rolled back in the object-based computation model. By using the influential messages, an object-based (O) checkpoints are defined to denote semantically consistent global states of the system while inconsistent with the traditional message-based definition. We show how many checkpoints can be reduced by taking only the O-checkpoints.