Composing Lineage Metadata with XML for Custom Satellite-Derived Data Products

  • Authors:
  • Rajendra Bose;James Frew

  • Affiliations:
  • University of California, Santa Barbara;University of California, Santa Barbara

  • Venue:
  • SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

As peer-to-peer dissemination of custom data productsevolves among Earth science research groups,investigators and data managers must consider how tocompose appropriate metadata for their researchcomputing activities. Because workflows may spanmultiple groups, it is critical that lineage (provenance)metadata also be assembled to document and preservethe origins and processing history of constituent dataproducts and transformations for future data consumers.To demonstrate methods for composing lineage metadatafor custom processing, we introduce our terminology forworkflow and employ a case study for the creation ofsatellite-derived ocean color data products. Our examplecontributes to a general metadata model for workflowthat incorporates lineage. We then discuss metadatarequirements for remote sensing-related data products.We propose two techniques for composing lineagemetadata, both based on accessory XML metadatadocuments that are paired with the data products andversioned data transformations they describe. The firsttechnique, implemented as a prototype, features adedicated lineage server that introduces the indirectionand flexibility necessary for Web-based lineagenavigation. The second, more promising techniqueinvolves defining a simple Resource DescriptionFramework (RDF) vocabulary for lineage metadata, andusing extant RDF/XML tools for query and navigation.These methods provide guidelines for composing lineagemetadata that are applicable to other domains.