ScaLAPACK user's guide
Future Generation Computer Systems - Special issue on metacomputing
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Host load prediction using linear models
Cluster Computing
Online Prediction of the Running Time of Tasks
Cluster Computing
A Performance Oriented Migration Framework For The Grid
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
A Performance Study of Monitoring and Information Services for Distributed Systems
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
GrADSolve: a grid-based RPC system for parallel computing with application-level scheduling
Journal of Parallel and Distributed Computing - Special issue on middleware
The Globus Striped GridFTP Framework and Server
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Performance evaluation of adaptive MPI
Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Mobile MPI programs in computational grids
Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Special Issue: Workflow in Grid Systems: Editorials
Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
Self-adaptive applications on the grid
Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
Future Generation Computer Systems - Special section: Information engineering and enterprise architecture in distributed computing environments
Pegasus: A framework for mapping complex scientific workflows onto distributed systems
Scientific Programming
A low-cost rescheduling policy for efficient mapping of workflows on grid systems
Scientific Programming - AxGrids 2004
Relative Performance of Scheduling Algorithms in Grid Environments
CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
Predicting locality phases for dynamic memory optimization
Journal of Parallel and Distributed Computing
The Internet Operating System: Middleware for Adaptive Distributed Computing
International Journal of High Performance Computing Applications
Adaptive performance control for distributed scientific coupled models
Proceedings of the 21st annual international conference on Supercomputing
Analysis of input-dependent program behavior using active profiling
Proceedings of the 2007 workshop on Experimental computer science
Malleable applications for scalable high performance computing
Cluster Computing
ICPP '07 Proceedings of the 2007 International Conference on Parallel Processing
Scheduling strategies for mapping application workflows onto the grid
HPDC '05 Proceedings of the High Performance Distributed Computing, 2005. HPDC-14. Proceedings. 14th IEEE International Symposium
A service-oriented WSRF-based architecture for metascheduling on computational Grids
Future Generation Computer Systems
Performance modeling of parallel applications for grid scheduling
Journal of Parallel and Distributed Computing
Cluster-Based Hybrid Scheduling Mechanisms for Workflow Applications on the Grid
ESCIENCE '08 Proceedings of the 2008 Fourth IEEE International Conference on eScience
Workflows and e-Science: An overview of workflow system features and capabilities
Future Generation Computer Systems
Hybrid Re-scheduling Mechanisms for Workflow Applications on Multi-cluster Grid
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
VGrADS: enabling e-Science workflows on grids and clouds with fault tolerance
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Concurrency and Computation: Practice & Experience
A strategy for scheduling tightly coupled parallel applications on clusters
Concurrency and Computation: Practice & Experience
Dynamic Resource-Critical Workflow Scheduling in Heterogeneous Environments
Job Scheduling Strategies for Parallel Processing
Program phase detection and exploitation
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Centralized versus distributed schedulers for multiple bag-of-task applications
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Globus toolkit version 4: software for service-oriented systems
NPC'05 Proceedings of the 2005 IFIP international conference on Network and Parallel Computing
Hi-index | 0.01 |
As computational Grids are increasingly used for executing long running multi-phase parallel applications, it is important to develop efficient rescheduling frameworks that adapt application execution in response to resource and application dynamics. In this paper, three strategies or algorithms have been developed for deciding when and where to reschedule parallel applications that execute on multi-cluster Grids. The algorithms derive rescheduling plans that consist of potential points in application execution for rescheduling and schedules of resources for application execution between two consecutive rescheduling points. Using large number of simulations, it is shown that the rescheduling plans developed by the algorithms can lead to large decrease in application execution times when compared to executions without rescheduling on dynamic Grid resources. The rescheduling plans generated by the algorithms are also shown to be competitive when compared to the near-optimal plans generated by brute-force methods. Of the algorithms, genetic algorithm yielded the most efficient rescheduling plans with 9---12% smaller average execution times than the other algorithms.