Database transaction models for advanced applications
Database transaction models for advanced applications
An overview of workflow management: from process modeling to workflow automation infrastructure
Distributed and Parallel Databases - Special issue on software support for work flow management
Journal of Parallel and Distributed Computing
Experiences with predicting resource performance on-line in computational grid settings
ACM SIGMETRICS Performance Evaluation Review
Specifying and Monitoring Guarantees in Commercial Grids through SLA
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
GridWorkflow: A Flexible Failure Handling Framework for the Grid
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
QoS-Aware Middleware for Web Services Composition
IEEE Transactions on Software Engineering
Adaptive grid job scheduling with genetic algorithms
Future Generation Computer Systems
A taxonomy of scientific workflow systems for grid computing
ACM SIGMOD Record
Making the Grid Predictable through Reservations and Performance Modelling
The Computer Journal
QoS Support for Time-Critical Grid Workflow Applications
E-SCIENCE '05 Proceedings of the First International Conference on e-Science and Grid Computing
The virtual resource manager: an architecture for SLA-aware resource management
CCGRID '04 Proceedings of the 2004 IEEE International Symposium on Cluster Computing and the Grid
SBAC-PAD '05 Proceedings of the 17th International Symposium on Computer Architecture on High Performance Computing
Software—Practice & Experience
Network-based resource allocation for Grid Computing within an SLA context
GCC '06 Proceedings of the Fifth International Conference on Grid and Cooperative Computing
Error recovery mechanism for grid-based workflow within SLA context
International Journal of High Performance Computing and Networking
Taxonomy of grid business models
GECON'07 Proceedings of the 4th international conference on Grid economics and business models
Transparent fault tolerance for grid applications
EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
Mapping workflows onto grid resources within an SLA context
EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Hi-index | 0.00 |
Supporting SLAs (Service Level Agreements) for Grid-based workflows requires providing mechanisms for handling errors (i.e., the failures of subjobs). In the context of this paper, we propose an error recovery mechanism which can handle one failed subjob of a workflow. The error recovery mechanism has a maximum of three phases, depending on the impact of the error. In each phase, we use a dedicated algorithm to remap the subjobs of the workflow to the resources. The main contributions of the paper are the error recovery mechanism for SLA-based workflows and the mapping algorithm G-map, which is used in the first phase of the recovery mechanism. The G-map remaps the groups of subjobs, which are directly affected by an error. The efficiency of the proposed algorithm is validated through simulation results.