Using agent wills to provide fault-tolerance in distributed shared memory systems

Authors:
Antony Rowstron
Affiliations:
Microsoft Research Ltd., Cambridge, UK
Venue:
EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing
Year:
2000

Citing 13
Cited 0

Generative communication in Linda

ACM Transactions on Programming Languages and Systems (TOPLAS)
Linda in context

Communications of the ACM
KLAIM: A Kernel Language for Agents Interaction and Mobility

IEEE Transactions on Software Engineering
Coordinating Multiagent Applications on the WWW: A Reference Architecture

IEEE Transactions on Software Engineering
Coordinating Java agents over the WWW

World Wide Web
WCL: A co-ordination language for geographically distributed agents

World Wide Web
Coordination for Internet Application Development

Autonomous Agents and Multi-Agent Systems
MARS: A Programmable Coordination Architecture for Mobile Agents

IEEE Internet Computing
Persistant Linda: Linda + Transactions + Query Processing

Research Directions in High-Level Parallel Programming Languages
Presence and Instant Messaging via HTTP /1.1: A Coordination Perspective

COORDINATION '99 Proceedings of the Third International Conference on Coordination Languages and Models
Mobile Co-ordination: Providing Fault Tolerance in Tuple Space Based Co-ordination Languages

COORDINATION '99 Proceedings of the Third International Conference on Coordination Languages and Models
Bonita: A set of tuple space primitives for distributed coordination

HICSS '97 Proceedings of the 30th Hawaii International Conference on System Sciences: Software Technology and Architecture - Volume 1
T spaces

IBM Systems Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we describe how we use mobile objects to provide distributed programs coordinating through a persistent distributed shared memory (DSM) with tolerance to sudden agent failure, and use the increasingly popular Linda-like tuple space languages as an example for implementation of the concept. In programs coordinating and communicating through a DSM a data structure is shared between multiple agents, and the agents update the shared structure directly. However, if an agent should suddenly fail it is often hard for the agents to make the data structures consistent with the new application state. For example consider if a data structure contains a list of active agents. In such a case, transactions can be used when adding and removing agent names from the list ensuring that that the data structure is consistent and does not become corrupted should an agent fail. However, if failure of the agent occurs after the name has been added, how does the application ensure the list is correct? We argue that using mobile objects we can provide wills for the agents to effectively enable them to ensure the shared data structure is application consistent, even once they have failed. We show how we have integrated the use of agent wills into a Linda system and show that we have not increased the complexity of program writing. The integration is simple and general, does not alter the underlying semantics of the operations performed in the will and the use of mobility is transparent to the programmer.