Modern heuristic techniques for combinatorial problems
Modern heuristic techniques for combinatorial problems
Scheduling algorithms for fault-tolerance in hard-real-time systems
Real-Time Systems - Special issue on responsive computer systems
Hardware-software co-synthesis of fault-tolerant real-time distributed embedded systems
EURO-DAC '95/EURO-VHDL '95 Proceedings of the conference on European design automation
Guest Editorial: A Review of Worst-Case Execution-TimeAnalysis
Real-Time Systems - Special issue on worst-case execution-time analysis
Tolerance to Multiple Transient Faults for Aperiodic Tasks in Hard Real-Time Systems
IEEE Transactions on Computers
Scheduling with bus access optimization for distributed embedded systems
IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special issue on the 11th international symposium on system-level synthesis and design (ISSS'98)
Fault-Tolerant Real-Time Systems: The Problem of Replica Determinism
Fault-Tolerant Real-Time Systems: The Problem of Replica Determinism
A Fault-Tolerant Scheduling Algorithm for Real-Time Periodic Tasks with Possible Software Faults
IEEE Transactions on Computers
A New Fault-Tolerant Technique for Improving the Schedulability in Multiprocessor Real-time Systems
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Systematic AUED Codes for Self-Checking Architectures
DFT '98 Proceedings of the 13th International Symposium on Defect and Fault-Tolerance in VLSI Systems
Modeling the Effect of Technology Trends on the Soft Error Rate of Combinational Logic
DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
On-line detection of logic errors due to crosstalk, delay, and transient faults
ITC '98 Proceedings of the 1998 IEEE International Test Conference
Design-For-Debug in Hardware/Software Co-Design
CODES '97 Proceedings of the 5th International Workshop on Hardware/Software Co-Design
Fault-Tolerant Real-Time Scheduling using Passive Replicas
PRFTS '97 Proceedings of the 1997 Pacific Rim International Symposium on Fault-Tolerant Systems
Analysis of checkpointing for schedulability of real-time systems
RTCSA '97 Proceedings of the 4th International Workshop on Real-Time Computing Systems and Applications
RTCSA '99 Proceedings of the Sixth International Conference on Real-Time Computing Systems and Applications
Worst Case Timing Requirement of Real-Time Tasks with Time Redundancy
RTCSA '99 Proceedings of the Sixth International Conference on Real-Time Computing Systems and Applications
The XBW Model for Dependable Real-Time Systems
ICPADS '98 Proceedings of the 1998 International Conference on Parallel and Distributed Systems
Roll-forward error recovery in embedded real-time systems
ICPADS '96 Proceedings of the 1996 International Conference on Parallel and Distributed Systems
The Interplay of Power Management and Fault Recovery in Real-Time Systems
IEEE Transactions on Computers
Proceedings of the conference on Design, automation and test in Europe - Volume 2
Compact thermal modeling for temperature-aware design
Proceedings of the 41st annual Design Automation Conference
Modeling and Simulation of Time Domain Faults in Digital Systems
IOLTS '04 Proceedings of the International On-Line Testing Symposium, 10th IEEE
Reliability-Aware Co-Synthesis for Embedded Systems
ASAP '04 Proceedings of the Application-Specific Systems, Architectures and Processors, 15th IEEE International Conference
Design Optimization of Time-and Cost-Constrained Fault-Tolerant Distributed Embedded Systems
Proceedings of the conference on Design, Automation and Test in Europe - Volume 2
Schedulability-driven frame packing for multicluster distributed embedded systems
ACM Transactions on Embedded Computing Systems (TECS)
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Two-Phase Distributed Observation Problems
ACSD '05 Proceedings of the Fifth International Conference on Application of Concurrency to System Design
Multiple Transient Faults in Logic: An Issue for Next Generation ICs
DFT '05 Proceedings of the 20th IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems
Proceedings of the 2005 Asia and South Pacific Design Automation Conference
Proceedings of the conference on Design, automation and test in Europe: Proceedings
Mapping of Fault-Tolerant Applications with Transparency on Distributed Embedded Systems*
DSD '06 Proceedings of the 9th EUROMICRO Conference on Digital System Design
Radiation Effects on Embedded Systems
Radiation Effects on Embedded Systems
Fault-Tolerant Systems
Online task-scheduling for fault-tolerant low-energy real-time systems
Proceedings of the 2006 IEEE/ACM international conference on Computer-aided design
Using Process-Level Redundancy to Exploit Multiple Cores for Transient Fault Tolerance
DSN '07 Proceedings of the 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
CODES+ISSS '07 Proceedings of the 5th IEEE/ACM international conference on Hardware/software codesign and system synthesis
Reliability-aware Co-synthesis for Embedded Systems
Journal of VLSI Signal Processing Systems
Implementing fault-tolerance in real-time programs by automatic program transformations
ACM Transactions on Embedded Computing Systems (TECS)
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
NP-complete scheduling problems
Journal of Computer and System Sciences
Trading off transient fault tolerance and power consumption in deep submicron (DSM) VLSI circuits
IEEE Transactions on Very Large Scale Integration (VLSI) Systems - Special section on the 2002 international symposium on low-power electronics and design (ISLPED)
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Energy efficient configuration for qos in reliable parallel servers
EDCC'05 Proceedings of the 5th European conference on Dependable Computing
IEEE Spectrum
Transparent recovery from intermittent faults in time-triggered distributed systems
IEEE Transactions on Computers
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Fault-Tolerant Distributed Deployment of Embedded Control Software
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Hi-index | 0.00 |
In this article, we propose a strategy for the synthesis of fault-tolerant schedules and for the mapping of fault-tolerant applications. Our techniques handle transparency/performance trade-offs and use the fault-occurrence information to reduce the overhead due to fault tolerance. Processes and messages are statically scheduled, and we use process reexecution for recovering from multiple transient faults. We propose a fine-grained transparent recovery, where the property of transparency can be selectively applied to processes and messages. Transparency hides the recovery actions in a selected part of the application so that they do not affect the schedule of other processes and messages. While leading to longer schedules, transparent recovery has the advantage of both improved debuggability and less memory needed to store the fault-tolerant schedules.