Proceedings of the 4th on Middleware doctoral symposium
Logical reliability of interacting real-time tasks
Proceedings of the conference on Design, automation and test in Europe
Reliability versus performance for critical applications
Journal of Parallel and Distributed Computing
Analyzing scheduling with transient failures
Information Processing Letters
Supporting fault-tolerance for time-critical events in distributed environments
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
EUC'07 Proceedings of the 2007 international conference on Embedded and ubiquitous computing
Towards middleware for fault-tolerance in distributed real-time and embedded systems
DAIS'08 Proceedings of the 8th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
Supporting fault-tolerance for time-critical events in distributed environments
Scientific Programming
A probabilistic approach for fault tolerant multiprocessor real-time scheduling
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Component deployment optimisation with bayesian learning
Proceedings of the 14th international ACM Sigsoft symposium on Component based software engineering
Architecture-based reliability evaluation under uncertainty
Proceedings of the joint ACM SIGSOFT conference -- QoSA and ACM SIGSOFT symposium -- ISARCS on Quality of software architectures -- QoSA and architecting critical systems -- ISARCS
Satisfaction-based query replication
Distributed and Parallel Databases
Architecture-Driven reliability and energy optimization for complex embedded systems
QoSA'10 Proceedings of the 6th international conference on Quality of Software Architectures: research into Practice - Reality and Gaps
A Multi-objective Approach for Workflow Scheduling in Heterogeneous Environments
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Architecture-driven reliability optimization with uncertain model parameters
Journal of Systems and Software
A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
Journal of Grid Computing
Reliability and performance optimization of pipelined real-time systems
Journal of Parallel and Distributed Computing
Multi-objective list scheduling of workflow applications in distributed computing infrastructures
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
Multi-criteria scheduling problems, involving optimizationof more than one criterion, are subject to a growinginterest. In this paper, we present a new bi-criteria schedulingheuristic for scheduling data-flow graphs of operationsonto parallel heterogeneous architectures according to twocriteria: first the minimization of the schedule length, andsecond the maximization of the system reliability. Reliabilityis defined as the probability that none of the system componentswill fail while processing. The proposed algorithmis a list scheduling heuristics, based on a bi-criteria compromisefunction that introduces priority between the operationsto be scheduled, and that chooses on what subsetof processors they should be scheduled. It uses the activereplication of operations to improve the reliability. If thesystem reliability or the schedule length requirements arenot met, then a parameter of the compromise function canbe changed and the algorithm re-executed. This process isiterated until both requirements are met.