Clock synchronization in distributed real-time systems
IEEE Transactions on Computers - Special Issue on Real-Time Systems
ACM Computing Surveys (CSUR)
Self-stabilizing systems in spite of distributed control
Communications of the ACM
IEEE Transactions on Computers
Formal Verification of the TTP Group Membership Algorithm
FORTE/PSTV 2000 Proceedings of the FIP TC6 WG6.1 Joint International Conference on Formal Description Techniques for Distributed Systems and Communication Protocols (FORTE XIII) and Protocol Specification, Testing and Verification (PSTV XX)
Bus Architectures for Safety-Critical Embedded Systems
EMSOFT '01 Proceedings of the First International Workshop on Embedded Software
An Overview of Formal Verification for the Time-Triggered Architecture
FTRTFT '02 Proceedings of the 7th International Symposium on Formal Techniques in Real-Time and Fault-Tolerant Systems: Co-sponsored by IFIP WG 2.2
Parametric Verification of a Group Membership Algorithm
FTRTFT '02 Proceedings of the 7th International Symposium on Formal Techniques in Real-Time and Fault-Tolerant Systems: Co-sponsored by IFIP WG 2.2
FTCS '98 Proceedings of the The Twenty-Eighth Annual International Symposium on Fault-Tolerant Computing
Avoiding the Babbling-Idiot Failure in a Time-Triggered Communication System
FTCS '98 Proceedings of the The Twenty-Eighth Annual International Symposium on Fault-Tolerant Computing
An Investigation of Membership and Clique Avoidance in TTP/C
SRDS '00 Proceedings of the 19th IEEE Symposium on Reliable Distributed Systems
Detectors and Correctors: A Theory of Fault-Tolerance Components
ICDCS '98 Proceedings of the The 18th International Conference on Distributed Computing Systems
The Central Guardian Approach to Enforce Fault Isolation in the Time-Triggered Architecture
ISADS '03 Proceedings of the The Sixth International Symposium on Autonomous Decentralized Systems (ISADS'03)
DSN '04 Proceedings of the 2004 International Conference on Dependable Systems and Networks
Ringing out Fault Tolerance. A New Ring Network for Superior Low-Cost Dependability
DSN '05 Proceedings of the 2005 International Conference on Dependable Systems and Networks
Coverage and the Use of Cyclic Redundancy Codes in Ultra-Dependable Systems
DSN '05 Proceedings of the 2005 International Conference on Dependable Systems and Networks
Startup and Recovery of Fault-Tolerant Time-Triggered Communication
Startup and Recovery of Fault-Tolerant Time-Triggered Communication
Advancements in dependable time-triggered communication
SEUS'07 Proceedings of the 5th IFIP WG 10.2 international conference on Software technologies for embedded and ubiquitous systems
Hi-index | 0.00 |
The Time-Triggered Architecture, as architecture for safety-critical real-time applications, incorporates fault-tolerance mechanisms to ensure correct system operation despite failures. The primary fault hypothesis of the TTA claims to tolerate either the arbitrary failure of any one of its nodes or the passively arbitrary failure of any one of its communication channels. To cover these failure modes, active redundancy techniques are used, which basically means that nodes and channels are physically replicated. The primary fault hypothesis, is, however, not strong enough for certain applications that have to tolerate transient upsets of multiple, possibly all, components in the system. Such a transient upset of the system may break up the synchrony of the nodes and leave disjoined sets of nodes synchronized to each other while the overall synchronization is lost. Although the TTA provides a clique avoidance algorithm that is able to correct a wide class of such multiple transient failures, a stronger algorithm is needed for full coverage. In this paper we discuss a secondary fault hypothesis for the TTA that addresses the transient upset of multiple components and present a new clique resolving algorithm based on the TTA's integrated diagnosis and startup service.