An identity crisis in the life sciences

Authors:
Jun Zhao;Carole Goble;Robert Stevens
Affiliations:
School of Computer Science, University of Manchester, U.K.;School of Computer Science, University of Manchester, U.K.;School of Computer Science, University of Manchester, U.K.
Venue:
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Year:
2006

Citing 6
Cited 7

Exploring Williams--Beuren syndrome using myGrid

Bioinformatics
The virtual data grid: a new model and architecture for data-intensive collaboration

SSDBM '03 Proceedings of the 15th International Conference on Scientific and Statistical Database Management
Taverna: lessons in creating a workflow environment for the life sciences: Research Articles

Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
Named graphs

Web Semantics: Science, Services and Agents on the World Wide Web
A protocol for recording provenance in service-oriented grids

OPODIS'04 Proceedings of the 8th international conference on Principles of Distributed Systems
Harvesting RDF triples

IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data

Recording the Context of Action for Process Documentation

Provenance and Annotation of Data and Processes
Provenance and the Price of Identity

Provenance and Annotation of Data and Processes
The Foundations for Provenance on the Web

Foundations and Trends in Web Science
Applying the virtual data provenance model

IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Performance evaluation of the karma provenance framework for scientific workflows

IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Principles of high quality documentation for provenance: a philosophical discussion

IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Flexibility and utility of the cell cycle ontology

Applied Ontology - Is there Beauty in Ontologies?

Quantified Score

Hi-index	0.00

Visualization

Abstract

myGrid is an e-Science project assisting life scientists to build workflows that gather data from distributed, autonomous, replicated and heterogeneous resources. The provenance logs of workflow executions are recorded as RDF graphs. The log of one workflow run is used to trace the history of its execution process. However, by aggregating provenance logs of many workflow runs, one may gather the provenance of a common data product shared in multiple derivation paths. A successful aggregation relies on accurate and universal identification of each data product. The nature of bioinformatics data and services, however, makes this difficult. We describe the identity problem in bioinformatics data, and present a protocol for managing identity co-references and allocating identity to gathered and computed data products. The ability to overcome this problem means that the provenance of workflows in bioinformatics and other domains can be exploited to enhance the practice of e-Science.