Dynamic Configuration Management in Reliable Distributed Real-Time Information Systems

Authors:
K. H. (Kane) Kim;Chittur Subbaraman
Affiliations:
-;-
Venue:
IEEE Transactions on Knowledge and Data Engineering
Year:
1999

Citing 14
Cited 2

The annotated C++ reference manual

The annotated C++ reference manual
DROL: an object-oriented programming language for distributed real-time systems

OOPSLA '92 conference proceedings on Object-oriented programming systems, languages, and applications
Fault-tolerant multicasting on hypercubes

Journal of Parallel and Distributed Computing
Advances in real-time systems

Advances in real-time systems
Action-level fault tolerance

Advances in real-time systems
An Object-Oriented Real-Time Programming Language

Computer
Object Structures for Real-Time Systems and Simulators

Computer
RMP: Fault-Tolerant Group Communication

IEEE Micro
PSRR: a scheme for time-bounded fault tolerance in distributed object-based systems

HASE '96 Proceedings of the 1996 High-Assurance Systems Engineering Workshop
A timeliness-guaranteed kernel model-DREAM kernel-and implementation techniques

RTCSA '95 Proceedings of the 2nd International Workshop on Real-Time Computing Systems and Applications
Distinguishing features and potential roles of the RTO.k object model

WORDS '94 Proceedings of the 1st Workshop on Object-Oriented Real-Time Dependable Systems
Toward Dependable Safety-Critical Software

WORDS '96 Proceedings of the 2nd Workshop on Object-Oriented Real-Time Dependable Systems (WORDS '96)
An Object-Oriented Fault-Tolerance Framework based on Specialization Techniques

WORDS '97 Proceedings of the 3rd Workshop on Object-Oriented Real-Time Dependable Systems - (WORDS '97)
The Totem System

FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing

The PSTR/SNS Scheme for Real-Time Fault Tolerance via Active Object Replication and Network Surveillance

IEEE Transactions on Knowledge and Data Engineering
Toward Integration Of Major Design Techniques For Real-Time Fault-Tolerant Computer Systems

Journal of Integrated Design & Process Science

Quantified Score

Hi-index	0.00

Visualization

Abstract

Large-scale information systems emerging in challenging application fields must meet the high standards of reliability, maintainability, and service interruption bound requirements. Their operations are entirely, or partially, of the distributed real-time data object manipulation type. A new architecture for such systems is presented in this paper. The original aspects of the architecture are mainly in two parts: 1) the time-triggered message-triggered object (TMO) structuring of the middleware and the application software of distributed real-time information systems; and 2) the dynamic configuration management subsystem (DCMS), based on the supervisor-based network surveillance (SNS) scheme. The positive impacts of this TMO structuring on maintainability and service interruption bounds are first discussed, with distributed replicated information service systems and other systems as examples. Then, the main discussion dwells on the DCMS architecture驴in particular, formal presentation of its key component: the SNS scheme. As a component of DCMS, the network surveillance (NS) subsystem enables fast learning by each interested fault-free node in the system of the faults or repair completion events occurring in other parts of the system. Currently, concrete real-time NS schemes effective in distributed systems based on point-to-point network architectures are scarce. The SNS scheme presented in this paper is a semicentralized real-time NS scheme effective in a variety of point-to-point networks. This scheme is highly scalable. An efficient implementation model for the SNS scheme is presented that can be easily adapted to various commercial operating system kernels. This paper also presents a formal analysis of the SNS scheme, on the basis of the implementation model, to obtain its strongly competitive tight bounds on the fault detection latency. Finally, some DCMS implementation issues are discussed that remain to be addressed in future research.