Design of Multi-Invariant Data Structures for Robust Shared Accesses in Multiprocessor Systems

Authors:
I-Ling Yen;Farokh B. Bastani;David J. Taylor
Affiliations:
Univ. of Texas at Dallas, Richardson;Univ. of Texas at Dallas, Richardson;Univ. of Waterloo, Waterloo, Ont., Canada
Venue:
IEEE Transactions on Software Engineering
Year:
2001

Citing 24
Cited 1

Implementation of resilient, atomic data types

ACM Transactions on Programming Languages and Systems (TOPLAS) - Lecture notes in computer science Vol. 174
Robust Storage Structures for Crash Recovery

IEEE Transactions on Computers - The MIT Press scientific computation series
Local Correction of Helix(k) Lists

IEEE Transactions on Computers
Tentative steps toward a development method for interfering programs

ACM Transactions on Programming Languages and Systems (TOPLAS)
Local Concurrent Error Detection and Correction in Data Structures Using Virtual Backpointers

IEEE Transactions on Computers
Understanding fault-tolerant distributed systems

Communications of the ACM
Distributed reset (extended abstract)

FST and TC 10 Proceedings of the tenth conference on Foundations of software technology and theoretical computer science
Stabilizing Communication Protocols

IEEE Transactions on Computers - Special issue on protocol engineering
Smart cars and highways go global

IEEE Spectrum
The Performance of Parity Placements in Disk Arrays

IEEE Transactions on Computers
Design of a Fault-Tolerant Three-Dimensional Dynamic Random-Access Memory with On-Chip Error-Correcting Circuit

IEEE Transactions on Computers
TTP-A Protocol for Fault-Tolerant Real-Time Systems

Computer
SuperStabilizing protocols for dynamic distributed systems

Proceedings of the fourteenth annual ACM symposium on Principles of distributed computing
Fault-tolerant real-time objects

Communications of the ACM
Component Based Design of Multitolerant Systems

IEEE Transactions on Software Engineering
Designing Masking Fault-Tolerance via Nonmasking Fault-Tolerance

IEEE Transactions on Software Engineering
Self-stabilizing systems in spite of distributed control

Communications of the ACM
Inductive methods for proving properties of programs

Communications of the ACM
Nonblocking commit protocols

SIGMOD '81 Proceedings of the 1981 ACM SIGMOD international conference on Management of data
Processor and Memory-Based Checkpoint and Rollback Recovery

Computer
A Fault Tolerant Replicated Storage System

Proceedings of the Third International Conference on Data Engineering
Notes on Data Base Operating Systems

Operating Systems, An Advanced Course
Atomic Transactions

Distributed Systems - Architecture and Implementation, An Advanced Course
Mathematical Theory of Computation

Mathematical Theory of Computation

Adaptive correctness monitoring for wireless sensor networks using hierarchical distributed run-time invariant checking

ACM Transactions on Autonomous and Adaptive Systems (TAAS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Multiprocessor systems are widely used in many application programs to enhance system reliability and performance. However, reliability does not come naturally with multiple processors. We develop a multi-invariant data structure approach to ensure efficient and robust access to shared data structures in multiprocessor systems. Essentially, the data structure is designed to satisfy two invariants, a strong invariant, and a weak invariant. The system operates at its peak performance when the strong invariant is true. The system will operate correctly even when only the weak invariant is true, though perhaps at a lower performance level. The design ensures that the weak invariant will always be true in spite of fail-stop processor failures during the execution. By allowing the system to converge to a state satisfying only the weak invariant, the overhead for incorporating fault tolerance can be reduced. In this paper, we present the basic idea of multi-invariant data structures. We also develop design rules that systematically convert fault-intolerant data abstractions into corresponding fault-tolerant versions. In this transformation, we augment the data structure and access algorithms to ensure that the system always converges to the weak invariant even in the presence of fail-stop processor failures. We also design methods for the detection of integrity violations and for restoring the strong invariant. Two data structures, namely, binary search tree and double-linked list, are used to illustrate the concept of multi-invariant data structures.