Optimizing availability and QoS of heterogeneous distributed system based on residual lifetime in uncertain environment

Authors:
Chuang Lin;Xin Jiang;Hao Yin;Yuanzhuo Wang;Yada Hu;Binbin Xiong
Affiliations:
Department of Computer Science and Technology, Tsinghua University, Beijing, People's Republic of China;Department of Computer Science and Technology, Tsinghua University, Beijing, People's Republic of China;Department of Computer Science and Technology, Tsinghua University, Beijing, People's Republic of China;Department of Computer Science and Technology, Tsinghua University, Beijing, People's Republic of China;Department of Computer Science and Technology, Tsinghua University, Beijing, People's Republic of China;Department of Computer Science and Technology, Tsinghua University, Beijing, People's Republic of China
Venue:
The Journal of Supercomputing
Year:
2009

Citing 16
Cited 0

Task Allocation for Maximizing Reliability of Distributed Computer Systems

IEEE Transactions on Computers
Approximation algorithms for NP-hard problems

Approximation algorithms for NP-hard problems
Safety and Reliability Driven Task Allocation in Distributed Systems

IEEE Transactions on Parallel and Distributed Systems
Distributed functions allocation for reliability and delay optimization

ACM '86 Proceedings of 1986 ACM Fall joint computer conference
A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems

Journal of Parallel and Distributed Computing
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing

IEEE Transactions on Parallel and Distributed Systems
Reliable Matching and Scheduling of Precedence-Constrained Tasks in Heterogeneous Distributed Computing

ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Grid Computing

Grid Computing
Scheduling methods for efficient utilization of cluster computing environments

Scheduling methods for efficient utilization of cluster computing environments
Measuring the Robustness of a Resource Allocation

IEEE Transactions on Parallel and Distributed Systems
Toward a Realistic Task Scheduling Model

IEEE Transactions on Parallel and Distributed Systems
A Stochastic Approach to Measuring the Robustness of Resource Allocations in Distributed Systems

ICPP '06 Proceedings of the 2006 International Conference on Parallel Processing
An Online Scheduling Algorithm for Assigning Jobs in the Computational Grid

IEICE - Transactions on Information and Systems
Dynamic resource allocation heuristics that manage tradeoff between makespan and robustness

The Journal of Supercomputing
Task Allocation in Distributed Data Processing

Computer

Quantified Score

Hi-index	0.00

Visualization

Abstract

A notable requirement of heterogeneous parallel and distributed computing systems is to maximize their processing performance and agreed upon QoS. Lots of work in this field has been done to optimize the system performance by improving certain metrics such as reliability, robustness, security, and so on. However, most of them assume that systems are running without interruption all the time and seldom consider the system's intrinsic characteristics, such as failure rate, repair rate, and lifetime. In this paper, we study how to achieve high availability based on residual lifetime analysis for heterogeneous distributed computational systems with considering their essential features. First, we provide an availability model taking into account system's expected residual lifetime. Second, we propose an objective function about the model and develop a heuristic scheduling algorithm to maximize the availability with the makespan constraint. At last, we demonstrate these advantages through the extensive simulated experiments.