Optimizing the fault tolerance capabilities of distributed real-time systems
ETFA'09 Proceedings of the 14th IEEE international conference on Emerging technologies & factory automation
Scheduling for real-time mobile MapReduce systems
Proceedings of the 5th ACM international conference on Distributed event-based system
Robust partitioning for real-time multiprocessor systems with shared resources
Proceedings of the 2011 ACM Symposium on Research in Applied Computation
Efficient constraint handling during designing reliable automotive real-time systems
Ada-Europe'12 Proceedings of the 17th Ada-Europe international conference on Reliable Software Technologies
Realizing a fault-tolerant embedded controller on distributed real-time systems
ACM SIGBED Review - Special Issue on the 5th Workshop on Adaptive and Reconfigurable Embedded Systems
Hi-index | 0.01 |
Previous research which has considered task allocation and fault-tolerance together has concentrated on constructingschedules which accommodate a fixed number of redundanttasks. Often, all faults are treated as being equallysevere. There is little work which combines task allocationwith architectural level fault-tolerance issues such as thenumber of replicas to use and how they should be configured,both of which are tackled by this work. An acceptedmethod for assessing the impact of a combination of faults is to build a system utility model which can be used to assess how the system degrades when components fail. The keychallenge addressed here is how to design objective functions based on a utility model which can be incorporatedinto a search algorithm in order to optimise fault-toleranceproperties. Other issues such as how to extend the localsearch neighbourhood and balance objectives with schedulability constraints are also discussed.