Fault-tolerant grid architecture and practice
Journal of Computer Science and Technology - Grid computing
An Agent Oriented Proactive Fault-Tolerant Framework for Grid Computing
E-SCIENCE '05 Proceedings of the First International Conference on e-Science and Grid Computing
A resource management and fault tolerance services in grid computing
Journal of Parallel and Distributed Computing - Special issue: Design and performance of networks for super-, cluster-, and grid-computing: Part II
An ant algorithm for balanced job scheduling in grids
Future Generation Computer Systems
Resource scheduling with conflicting objectives in grid environments: Model and evaluation
Journal of Network and Computer Applications
Journal of Parallel and Distributed Computing
Static strategy and dynamic adjustment: An effective method for Grid task scheduling
Future Generation Computer Systems
Adaptive checkpointing strategy to tolerate faults in economy based grid
The Journal of Supercomputing
Performance evaluation of fault tolerance techniques in grid computing system
Computers and Electrical Engineering
Fault-Tolerant Middleware for Grid Computing
HPCC '10 Proceedings of the 2010 IEEE 12th International Conference on High Performance Computing and Communications
Survey: Survey of fault tolerant techniques for grid
Computer Science Review
A queuing network model for minimizing the total makespan of computational grids
Computers and Electrical Engineering
A framework for establishing trust in the Cloud
Computers and Electrical Engineering
Hi-index | 0.00 |
Fault-tolerant scheduling is an important issue for computational grid systems, as grids typically consist of strongly varying and geographically distributed resources. The main scheduling strategy of most fault-tolerant scheduling systems depends on the response time and fault index when selecting a resource to execute a certain job. In this paper, a scheduling system is presented that depends on a new factor called scheduling indicator in selecting resources. This factor comprises of the response time and the failure rate of grid resources. Whenever a grid scheduler has jobs to schedule on grid resources, it uses the scheduling indicator to generate the scheduling decisions. The main scheduling strategy of the system is to select resources that have the lowest tendency to fail. Extensive simulation experiments are conducted to quantify the performance of the proposed system. Experiments have shown that the proposed system can considerably improve grid performance in terms of throughput, unavailability, turnaround time, and fail tendency.