Principles of transaction-oriented database recovery
ACM Computing Surveys (CSUR)
Error recovery in asynchronous systems
IEEE Transactions on Software Engineering
Atomic actions for fault-tolerance using CSP
IEEE Transactions on Software Engineering
Implementing fault-tolerant services using the state machine approach: a tutorial
ACM Computing Surveys (CSUR)
The C++ programming language (2nd ed.)
The C++ programming language (2nd ed.)
The Java programming language (2nd ed.)
The Java programming language (2nd ed.)
Exception handling in agent systems
Proceedings of the third annual conference on Autonomous Agents
Distributed systems (2nd Ed.)
The Timed Asynchronous Distributed System Model
IEEE Transactions on Parallel and Distributed Systems
Concurrent Exception Handling and Resolution in Distributed Object Systems
IEEE Transactions on Parallel and Distributed Systems
Reliability Issues in Computing System Design
ACM Computing Surveys (CSUR)
Exception handling: issues and a proposed notation
Communications of the ACM
Rigorous Development of an Embedded Fault-Tolerant System Based on Coordinated Atomic Actions
IEEE Transactions on Computers - Special issue on fault-tolerant embedded systems
ADA as a Second Language
Fault Tolerance: Principles and Practice
Fault Tolerance: Principles and Practice
Notes on Data Base Operating Systems
Operating Systems, An Advanced Course
Rigorous Development of a Safety-Critical System Based on Coordinated Atomic Actions
FTCS '99 Proceedings of the Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing
Process structuring, synchronization, and recovery using atomic actions
Proceedings of an ACM conference on Language design for reliable software
The Guardian Model for Exception Handling in Distributed Systems
SRDS '02 Proceedings of the 21st IEEE Symposium on Reliable Distributed Systems
Open Multithreaded Transactions: Keeping Threads and Exceptions under Control
WORDS '01 Proceedings of the Sixth International Workshop on Object-Oriented Real-Time Dependable Systems (WORDS'01)
Exception handling and resolution in distributed object-oriented systems
ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)
A Modular Approach to Fault-Tolerant Broadcasts and Related Problems
A Modular Approach to Fault-Tolerant Broadcasts and Related Problems
The guardian model for exception handling in distributed systems
The guardian model for exception handling in distributed systems
Challenges in exception handling in multi-agent systems
Proceedings of the 2006 international workshop on Software engineering for large-scale multi-agent systems
An architecture for exception management in multiagent systems
International Journal of Agent-Oriented Software Engineering
Challenges for Exception Handling in Multi-Agent Systems
Software Engineering for Multi-Agent Systems V
Specification of an exception handling system for a replicated agent environment
Proceedings of the 4th international workshop on Exception handling
A definition of exceptions in agent-oriented computing
ESAW'06 Proceedings of the 7th international conference on Engineering societies in the agents world VII
Context-oriented exception handling
International Journal of High Performance Systems Architecture
Exception handling and asynchronous active objects: issues and proposal
Advanced Topics in Exception Handling Techniques
Ambient-Oriented exception handling
Advanced Topics in Exception Handling Techniques
Exception handling in CSCW applications in pervasive computing environments
Advanced Topics in Exception Handling Techniques
Hi-index | 0.00 |
This paper presents an abstraction called guardian for exception handling in distributed and concurrent systems that use coordinated exception handling. This model addresses two fundamental problems with distributed exception handling in a group of asynchronous processes. The first is to perform recovery when multiple exceptions are concurrently signaled. The second is to determine the correct context in which a process should execute its exception handling actions. Several schemes have been proposed in the past to address these problems. These are based on structuring a distributed program as atomic actions based on conversations or transactions and resolving multiple concurrent exceptions into a single one. The guardian in a distributed program represents the abstraction of a global exception handler, which encapsulates rules for handling concurrent exceptions and directing each process to the semantically correct context for executing its recovery actions. Its programming primitives and the underlying distributed execution model are presented here. In contrast to the existing approaches, this model is more basic and can be used to implement or enhance the existing schemes. Using several examples we illustrate the capabilities of this model. Finally, its advantages and limitations are discussed in contrast to existing approaches.