Error recovery in asynchronous systems
IEEE Transactions on Software Engineering
The B-book: assigning programs to meanings
The B-book: assigning programs to meanings
Advanced Exception Handling Mechanisms
IEEE Transactions on Software Engineering
Alloy: a lightweight object modelling notation
ACM Transactions on Software Engineering and Methodology (TOSEM)
Rigorous Development of an Embedded Fault-Tolerant System Based on Coordinated Atomic Actions
IEEE Transactions on Computers - Special issue on fault-tolerant embedded systems
Transaction Processing: Concepts and Techniques
Transaction Processing: Concepts and Techniques
Fault Tolerance: Principles and Practice
Fault Tolerance: Principles and Practice
Structuring Integrated Web Applications for Fault Tolerance
ISADS '03 Proceedings of the The Sixth International Symposium on Autonomous Decentralized Systems (ISADS'03)
Fault Tolerance in Concurrent Object-Oriented Software through Coordinated Error Recovery
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Finding and preventing run-time error handling mistakes
OOPSLA '04 Proceedings of the 19th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Exception handling in the development of dependable component-based systems
Software—Practice & Experience - Research Articles
A formal architectural model for exception handling coordination
Proceedings of the 2008 ACM symposium on Applied computing
Improving reliability of cooperative concurrent systems with exception flow analysis
Journal of Systems and Software
Global-to-local approach to rigorously developing distributed system with exception handling
Journal of Computer Science and Technology
Science of Computer Programming
Reasoning about exception flow at the architectural level
Rigorous Development of Complex Fault-Tolerant Systems
Hi-index | 0.01 |
An important challenge faced by the developers of fault-tolerant distributed systems is to build fault tolerance mechanisms that are reliable. To achieve the desired levels of reliability, the development of mechanisms for detecting and handling errors should be rigorous or formal. In this paper, we present an approach to modeling and verifying fault-tolerant distributed systems that use exception handling as the main fault tolerance mechanism. The proposed approach is based on a formal model for specifying the structure of a system in terms of cooperating participants that handle exceptions in a coordinated manner. We use a medical control system as a case study to validate the proposed approach.