Semantic metadata generation for large scientific workflows

  • Authors:
  • Jihie Kim;Yolanda Gil;Varun Ratnakar

  • Affiliations:
  • Information Sciences Institute, University of Southern California, Marina del Rey, CA, United States;Information Sciences Institute, University of Southern California, Marina del Rey, CA, United States;Information Sciences Institute, University of Southern California, Marina del Rey, CA, United States

  • Venue:
  • ISWC'06 Proceedings of the 5th international conference on The Semantic Web
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In recent years, workflows have been increasingly used in scientific applications. This paper presents novel metadata reasoning capabilities that we have developed to support the creation of large workflows. They include 1) use of semantic web technologies in handling metadata constraints on file collections and nested file collections, 2) propagation and validation of metadata constraints from inputs to outputs in a workflow component, and through the links among components in a workflow, and 3) sub-workflows that generate metadata needed for workflow creation. We show how we used these capabilities to support the creation of large executable workflows in an earthquake science application with more than 7,000 jobs, generating metadata for more than 100,000 new files.