IEEE Transactions on Parallel and Distributed Systems
Reliability-aware scheduling strategy for heterogeneous distributed computing systems
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
This paper introduces algorithms, which can produce both optimal, and suboptimal task assignments to minimize the probability of failure of an application executing on a heterogeneous distributed computing system. A cost function which defines this probability under a given task assignment is derived. To find optimal and suboptimal task assignments efficiently, the reliable matching and scheduling problem is converted to the state-space search problem in which the cost function derived is used to guide the search. The A* algorithm for finding optimal task assignments and the \math and hill-climbing algorithms for finding suboptimal task assignments are presented. The simulation results are provided to confirm the performance of the proposed algorithms.