The annotated C++ reference manual
The annotated C++ reference manual
DROL: an object-oriented programming language for distributed real-time systems
OOPSLA '92 conference proceedings on Object-oriented programming systems, languages, and applications
Fault-tolerant multicasting on hypercubes
Journal of Parallel and Distributed Computing
Advances in real-time systems
Advances in real-time systems
RMP: Fault-Tolerant Group Communication
IEEE Micro
PSRR: a scheme for time-bounded fault tolerance in distributed object-based systems
HASE '96 Proceedings of the 1996 High-Assurance Systems Engineering Workshop
A timeliness-guaranteed kernel model-DREAM kernel-and implementation techniques
RTCSA '95 Proceedings of the 2nd International Workshop on Real-Time Computing Systems and Applications
Distinguishing features and potential roles of the RTO.k object model
WORDS '94 Proceedings of the 1st Workshop on Object-Oriented Real-Time Dependable Systems
Toward Dependable Safety-Critical Software
WORDS '96 Proceedings of the 2nd Workshop on Object-Oriented Real-Time Dependable Systems (WORDS '96)
An Object-Oriented Fault-Tolerance Framework based on Specialization Techniques
WORDS '97 Proceedings of the 3rd Workshop on Object-Oriented Real-Time Dependable Systems - (WORDS '97)
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
IEEE Transactions on Knowledge and Data Engineering
Toward Integration Of Major Design Techniques For Real-Time Fault-Tolerant Computer Systems
Journal of Integrated Design & Process Science
Hi-index | 0.00 |
Large-scale information systems emerging in challenging application fields must meet the high standards of reliability, maintainability, and service interruption bound requirements. Their operations are entirely, or partially, of the distributed real-time data object manipulation type. A new architecture for such systems is presented in this paper. The original aspects of the architecture are mainly in two parts: 1) the time-triggered message-triggered object (TMO) structuring of the middleware and the application software of distributed real-time information systems; and 2) the dynamic configuration management subsystem (DCMS), based on the supervisor-based network surveillance (SNS) scheme. The positive impacts of this TMO structuring on maintainability and service interruption bounds are first discussed, with distributed replicated information service systems and other systems as examples. Then, the main discussion dwells on the DCMS architecture驴in particular, formal presentation of its key component: the SNS scheme. As a component of DCMS, the network surveillance (NS) subsystem enables fast learning by each interested fault-free node in the system of the faults or repair completion events occurring in other parts of the system. Currently, concrete real-time NS schemes effective in distributed systems based on point-to-point network architectures are scarce. The SNS scheme presented in this paper is a semicentralized real-time NS scheme effective in a variety of point-to-point networks. This scheme is highly scalable. An efficient implementation model for the SNS scheme is presented that can be easily adapted to various commercial operating system kernels. This paper also presents a formal analysis of the SNS scheme, on the basis of the implementation model, to obtain its strongly competitive tight bounds on the fault detection latency. Finally, some DCMS implementation issues are discussed that remain to be addressed in future research.