Provenance-based validation of e-science experiments

  • Authors:
  • Simon Miles;Sylvia C. Wong;Weijian Fang;Paul Groth;Klaus-Peter Zauner;Luc Moreau

  • Affiliations:
  • School of Electronics and Computer Science, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom;School of Electronics and Computer Science, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom;School of Electronics and Computer Science, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom;School of Electronics and Computer Science, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom;School of Electronics and Computer Science, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom;School of Electronics and Computer Science, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom

  • Venue:
  • Web Semantics: Science, Services and Agents on the World Wide Web
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

E-science experiments typically involve many distributed services maintained by different organisations. After an experiment has been executed, it is useful for a scientist to verify that the execution was performed correctly or is compatible with some existing experimental criteria or standards, not necessarily anticipated prior to execution. Scientists may also want to review and verify experiments performed by their colleagues. There are no existing frameworks for validating such experiments in today's e-science systems. Users therefore have to rely on error checking performed by the services, or adopt other ad hoc methods. This paper introduces a platform-independent framework for validating workflow executions. The validation relies on reasoning over the documented provenance of experiment results and semantic descriptions of services advertised in a registry. This validation process ensures experiments are performed correctly, and thus results generated are meaningful. The framework is tested in a bioinformatics application that performs protein compressibility analysis.