General atomic and molecular electronic structure system
Journal of Computational Chemistry
Input/output characteristics of scalable parallel applications
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
An Extended Banker's Algorithm for Deadlock Avoidance
IEEE Transactions on Software Engineering
Modern Operating Systems
On Scheduling Mesh-Structured Computations for Internet-Based Computing
IEEE Transactions on Computers
Grid-Enabled Workflows for Data Intensive Medical Applications
CBMS '05 Proceedings of the 18th IEEE Symposium on Computer-Based Medical Systems
Scientific workflow management and the Kepler system: Research Articles
Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
Explicit control a batch-aware distributed file system
NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
Scheduling Data-IntensiveWorkflows onto Storage-Constrained Distributed Resources
CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
Scheduling of Scientific Workflows on Data Grids
CCGRID '08 Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid
Scientific Cloud Computing: Early Definition and Experience
HPCC '08 Proceedings of the 2008 10th IEEE International Conference on High Performance Computing and Communications
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Proceedings of the second international workshop on Data-aware distributed computing
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
An integrated resource management and scheduling system for grid data streaming applications
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
Scheduling Multiple Parameter Sweep Workflow Instances on the Grid
E-SCIENCE '09 Proceedings of the 2009 Fifth IEEE International Conference on e-Science
High-Performance Cloud Computing: A View of Scientific Applications
ISPAN '09 Proceedings of the 2009 10th International Symposium on Pervasive Systems, Algorithms, and Networks
Parameter Exploration in Science and Engineering Using Many-Task Computing
IEEE Transactions on Parallel and Distributed Systems
Magellan: experiences from a science cloud
Proceedings of the 2nd international workshop on Scientific cloud computing
Hi-index | 0.00 |
Workflow-based workloads usually consist of multiple instances of the same workflow, which are jobs with control or data dependencies, to carry out a well-defined scientific computation task, with each instance acting on its own input data. To maximize throughput performance, a high degree of concurrency is achievable by running multiple instances simultaneously. However, deadlock is a potential problem when storage is constrained. To address this problem, we design and evaluate a deadlock detection-based scheduling (DDS) algorithm that can achieve high performance by making the best use of the available storage resources. Our algorithm takes advantages of the dataflow information of the workflow to speculatively schedule each instance if the instant storage is sufficient for some constituent jobs, but not necessarily for the whole workflow instance. Whenever deadlock or a performance anomaly is detected, some selected in-progress workflow instances are required to be rollbacked to release storage for other blocked jobs. We develop a suite of strategies to select the victims and beneficiaries (instances or jobs) and evaluate their performance via a simulation-based study. Our results show that the DDS algorithm can adapt the job concurrency to the available storage resources and achieve higher performance than some deadlock avoidance methods in our synthetic and real workflow computations.