The AppLeS parameter sweep template: user-level middleware for the grid
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
The MultiCluster Model to the Integrated Use of Multiple Workstation Clusters
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Metrics and Benchmarking for Parallel Job Scheduling
IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Multiple-Queue Backfilling Scheduling with Priorities and Reservations for Parallel Systems
JSSPP '02 Revised Papers from the 8th International Workshop on Job Scheduling Strategies for Parallel Processing
Performance Evaluation of an Agent-Based Resource Management Infrastructure for Grid Computing
CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
Optimizing Static Job Scheduling in a Network of Heterogeneous Computers
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
QoS guided min-min heuristic for grid task scheduling
Journal of Computer Science and Technology - Grid computing
Queueing Network-based Optimisation Techniques for Workload Allocation in Clusters of Computers
SCC '04 Proceedings of the 2004 IEEE International Conference on Services Computing
Pace--A Toolset for the Performance Prediction of Parallel and Distributed Systems
International Journal of High Performance Computing Applications
Limit Allocation: An Efficient Processor Management Scheme for Hypercubes
ICPP '94 Proceedings of the 1994 International Conference on Parallel Processing - Volume 02
Allocating Non-Real-Time and Soft Real-Time Jobs in Multiclusters
IEEE Transactions on Parallel and Distributed Systems
Towards achieving reliable and high-performance nanocomputing via dynamic redundancy allocation
ACM Journal on Emerging Technologies in Computing Systems (JETC)
Distributed resource scheduling in grid computing using fuzzy approach
ICCOMP'08 Proceedings of the 12th WSEAS international conference on Computers
A framework for providing hard delay guarantees and user fairness in Grid computing
Future Generation Computer Systems
Resource co-allocation for large-scale distributed environments
Proceedings of the 18th ACM international symposium on High performance distributed computing
SAQA: A Self-Adaptive QoS-Aware Scheduling Algorithm for Real-Time Tasks on Heterogeneous Clusters
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
A neural network realization of scheduling in grid computing environment
SMO'05 Proceedings of the 5th WSEAS international conference on Simulation, modelling and optimization
Journal of Systems and Software
Using an enterprise grid for execution of MPI parallel applications: a case study
EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
QoS and preemption aware scheduling in federated and virtualized Grid computing environments
Journal of Parallel and Distributed Computing
Enhancing security of real-time applications on grids through dynamic scheduling
JSSPP'05 Proceedings of the 11th international conference on Job Scheduling Strategies for Parallel Processing
Coordinated rescheduling of Bag-of-Tasks for executions on multiple resource providers
Concurrency and Computation: Practice & Experience
Failure-aware resource provisioning for hybrid Cloud infrastructure
Journal of Parallel and Distributed Computing
Double auction-inspired meta-scheduling of parallel applications on global grids
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
This paper addresses the dynamic scheduling of parallel jobs with QoS demands (soft-deadlines) in multi-clusters and grids. Three metrics (over-deadline, makespan and idle-time) are combined with variable weights to evaluate the scheduling performance. These three metrics are used to measure the extent of the jobs' QoS demand compliance, the resource throughput and the resource utilization. Two levels of performance optimisation are applied in the multicluster. At the multicluster level, a scheduler (which we call MUSCLE) allocates parallel jobs with high packing potential to the same cluster; it also takes the jobs' QoS requirements into account and employs a heuristic to allocate suitable workloads to each cluster to balance the overall system performance. At the single cluster level, an existing workload manager, called TITAN, utilizes a genetic algorithm to further improve the scheduling performance of the jobs previously allocated by MUSCLE. Extensive experimental studies are conducted to verify the effectiveness of the scheduling mechanism as well as the effect of the prediction accuracy on the scheduling performance. The results show that compared with traditional distributed workload allocation policies, the comprehensive scheduling performance of parallel jobs is significantly improved across the multicluster, and the presence of prediction errors does not dramatically weaken the performance advantage.