Static scheduling of synchronous data flow programs for digital signal processing
IEEE Transactions on Computers
Stork: Making Data Placement a First Class Citizen in the Grid
ICDCS '04 Proceedings of the 24th International Conference on Distributed Computing Systems (ICDCS'04)
Scientific workflow management and the Kepler system: Research Articles
Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
Workflow automation for processing plasma fusion simulation data
Proceedings of the 2nd workshop on Workflows in support of large-scale science
Taverna Workflows: Syntax and Semantics
E-SCIENCE '07 Proceedings of the Third IEEE International Conference on e-Science and Grid Computing
From computation models to models of provenance: the RWS approach
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
A grid-enabled workflow system for reservoir uncertainty analysis
CLADE '08 Proceedings of the 6th international workshop on Challenges of large applications in distributed environments
Reliable DAG scheduling on grids with rewinding and migration
Proceedings of the first international conference on Networks for grid applications
Real-Time Distributed Discrete-Event Execution with Fault Tolerance
RTAS '08 Proceedings of the 2008 IEEE Real-Time and Embedded Technology and Applications Symposium
A Provenance-Based Fault Tolerance Mechanism for Scientific Workflows
Provenance and Annotation of Data and Processes
The Open Provenance Model: An Overview
Provenance and Annotation of Data and Processes
Atomicity and provenance support for pipelined scientific workflows
Future Generation Computer Systems
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Scientific workflow design 2.0: Demonstrating streaming data collections in Kepler
ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering
A model for user-oriented data provenance in pipelined scientific workflows
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Capturing and querying workflow runtime provenance with PROV: a practical approach
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Towards semantic comparison of multi-granularity process traces
Knowledge-Based Systems
Approaches to Distributed Execution of Scientific Workflows in Kepler
Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
Hi-index | 0.00 |
Scientific workflow systems frequently are used to execute a variety of long-running computational pipelines prone to premature termination due to network failures, server outages, and other faults. Researchers have presented approaches for providing fault tolerance for portions of specific workflows, but no solution handles faults that terminate the workflow engine itself when executing a mix of stateless and stateful workflow components. Here we present a general framework for efficiently resuming workflow execution using information commonly captured by workflow systems to record data provenance. Our approach facilitates fast workflow replay using only such commonly recorded provenance data. We also propose a checkpoint extension to standard provenance models to significantly reduce the computation needed to reset the workflow to a consistent state, thus resulting in much shorter reexecution times. Our work generalizes the rescue-DAG approach used by DAGMan to richer workflow models that may contain stateless and stateful multi-invocation actors as well as workflow loops.