Dynamic Scheduling of Parallel Jobs with QoS Demands in Multiclusters and Grids

Authors:
Ligang He;Stephen A. Jarvis;Daniel P. Spooner;Xinuo Chen;Graham R. Nudd
Affiliations:
University of Warwick, UK;University of Warwick, UK;University of Warwick, UK;University of Warwick, UK;University of Warwick, UK
Venue:
GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
Year:
2004

Citing 10
Cited 16

The AppLeS parameter sweep template: user-level middleware for the grid

Proceedings of the 2000 ACM/IEEE conference on Supercomputing
The MultiCluster Model to the Integrated Use of Multiple Workstation Clusters

IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Metrics and Benchmarking for Parallel Job Scheduling

IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Multiple-Queue Backfilling Scheduling with Priorities and Reservations for Parallel Systems

JSSPP '02 Revised Papers from the 8th International Workshop on Job Scheduling Strategies for Parallel Processing
Performance Evaluation of an Agent-Based Resource Management Infrastructure for Grid Computing

CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
Optimizing Static Job Scheduling in a Network of Heterogeneous Computers

ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
QoS guided min-min heuristic for grid task scheduling

Journal of Computer Science and Technology - Grid computing
Queueing Network-based Optimisation Techniques for Workload Allocation in Clusters of Computers

SCC '04 Proceedings of the 2004 IEEE International Conference on Services Computing
Pace--A Toolset for the Performance Prediction of Parallel and Distributed Systems

International Journal of High Performance Computing Applications
Limit Allocation: An Efficient Processor Management Scheme for Hypercubes

ICPP '94 Proceedings of the 1994 International Conference on Parallel Processing - Volume 02

Allocating Non-Real-Time and Soft Real-Time Jobs in Multiclusters

IEEE Transactions on Parallel and Distributed Systems
Towards achieving reliable and high-performance nanocomputing via dynamic redundancy allocation

ACM Journal on Emerging Technologies in Computing Systems (JETC)
Distributed resource scheduling in grid computing using fuzzy approach

ICCOMP'08 Proceedings of the 12th WSEAS international conference on Computers
A framework for providing hard delay guarantees and user fairness in Grid computing

Future Generation Computer Systems
Resource co-allocation for large-scale distributed environments

Proceedings of the 18th ACM international symposium on High performance distributed computing
A comparison of utility-oriented algorithms for scheduling parallel tasks in multi-cluster grid

Cluster Computing
SAQA: A Self-Adaptive QoS-Aware Scheduling Algorithm for Real-Time Tasks on Heterogeneous Clusters

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Performance evaluation of scheduling applications with DAG topologies on multiclusters with independent local schedulers

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
A neural network realization of scheduling in grid computing environment

SMO'05 Proceedings of the 5th WSEAS international conference on Simulation, modelling and optimization
Boosting adaptivity of fault-tolerant scheduling for real-time tasks with service requirements on clusters

Journal of Systems and Software
Using an enterprise grid for execution of MPI parallel applications: a case study

EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
QoS and preemption aware scheduling in federated and virtualized Grid computing environments

Journal of Parallel and Distributed Computing
Enhancing security of real-time applications on grids through dynamic scheduling

JSSPP'05 Proceedings of the 11th international conference on Job Scheduling Strategies for Parallel Processing
Coordinated rescheduling of Bag-of-Tasks for executions on multiple resource providers

Concurrency and Computation: Practice & Experience
Failure-aware resource provisioning for hybrid Cloud infrastructure

Journal of Parallel and Distributed Computing
Double auction-inspired meta-scheduling of parallel applications on global grids

Journal of Parallel and Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses the dynamic scheduling of parallel jobs with QoS demands (soft-deadlines) in multi-clusters and grids. Three metrics (over-deadline, makespan and idle-time) are combined with variable weights to evaluate the scheduling performance. These three metrics are used to measure the extent of the jobs' QoS demand compliance, the resource throughput and the resource utilization. Two levels of performance optimisation are applied in the multicluster. At the multicluster level, a scheduler (which we call MUSCLE) allocates parallel jobs with high packing potential to the same cluster; it also takes the jobs' QoS requirements into account and employs a heuristic to allocate suitable workloads to each cluster to balance the overall system performance. At the single cluster level, an existing workload manager, called TITAN, utilizes a genetic algorithm to further improve the scheduling performance of the jobs previously allocated by MUSCLE. Extensive experimental studies are conducted to verify the effectiveness of the scheduling mechanism as well as the effect of the prediction accuracy on the scheduling performance. The results show that compared with traditional distributed workload allocation policies, the comprehensive scheduling performance of parallel jobs is significantly improved across the multicluster, and the presence of prediction errors does not dramatically weaken the performance advantage.