The grid: blueprint for a new computing infrastructure
The grid: blueprint for a new computing infrastructure
A Provenance-Aware Weighted Fault Tolerance Scheme for Service-Based Applications
ISORC '05 Proceedings of the Eighth IEEE International Symposium on Object-Oriented Real-Time Distributed Computing
A taxonomy of scientific workflow systems for grid computing
ACM SIGMOD Record
Introduction to Data Mining, (First Edition)
Introduction to Data Mining, (First Edition)
Pegasus: A framework for mapping complex scientific workflows onto distributed systems
Scientific Programming
A Multi-Perspective Taxonomy for Systematic Classification of Grid Faults
PDP '08 Proceedings of the 16th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2008)
Provenance trails in the Wings-Pegasus system
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Provenance and scientific workflows: challenges and opportunities
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Future Generation Computer Systems
Flexible and Efficient Workflow Deployment of Data-Intensive Applications On Grids With MOTEUR
International Journal of High Performance Computing Applications
Kepler/pPOD: Scientific Workflow and Provenance Support for Assembling the Tree of Life
Provenance and Annotation of Data and Processes
Workflows and e-Science: An overview of workflow system features and capabilities
Future Generation Computer Systems
Troubleshooting thousands of jobs on production grids using data mining techniques
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
A data-driven workflow language for grids based on array programming principles
Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
A virtual laboratory for medical image analysis
IEEE Transactions on Information Technology in Biomedicine
The Open Provenance Model core specification (v1.1)
Future Generation Computer Systems
Future Generation Computer Systems
Processing moldable tasks on the grid: Late job binding with lightweight user-level overlay
Future Generation Computer Systems
Provenance opportunities for WS-VLAM: an exploration of an e-science and an e-business approach
Proceedings of the 6th workshop on Workflows in support of large-scale science
Failure prediction and localization in large scientific workflows
Proceedings of the 6th workshop on Workflows in support of large-scale science
A Provenance Approach to Trace Scientific Experiments on a Grid Infrastructure
ESCIENCE '11 Proceedings of the 2011 IEEE Seventh International Conference on eScience
A protocol for recording provenance in service-oriented grids
OPODIS'04 Proceedings of the 8th international conference on Principles of Distributed Systems
Self-Healing of Operational Workflow Incidents on Distributed Computing Infrastructures
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Database support for exploring scientific workflow provenance graphs
SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
A Grid-Enabled Gateway for Biomedical Data Analysis
Journal of Grid Computing
Toward fine-grained online task characteristics estimation in scientific workflows
WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Aligning ontology-based development with service oriented systems
Future Generation Computer Systems
Hi-index | 0.00 |
Grid computing and workflow management systems emerged as solutions to the challenges arising from the processing and storage of shear volumes of data generated by modern simulations and data acquisition devices. Workflow management systems usually document the process of the workflow execution either as structured provenance information or as log files. Provenance is recognized as an important feature in workflow management systems, however there are still few reports on its usage in practical cases. In this paper we present the provenance system implemented in our platform, and then use the information captured by this system during 8 months of platform operation to analyze the platform usage and to perform multilevel error pattern analysis. We make use of the large amount of structured data using the explanatory potential of statistical approaches to find properties of workflows, jobs and resources that are related to workflow failure. Such an analysis enables us to characterize workflow executions on the infrastructure and understand workflow failures. The approach is generic and applicable to other e-infrastructures to gain insight into operational incidents.