Towards a generic infrastructure for sustainable management of quality controlled primary data

Authors:
Thomas Buchmann;Stefan Jablonski;Bernhard Volz;Bernhard Westfechtel
Affiliations:
Institute for Computer Science, University of Bayreuth, Bayreuth, Germany;Institute for Computer Science, University of Bayreuth, Bayreuth, Germany;Institute for Computer Science, University of Bayreuth, Bayreuth, Germany;Institute for Computer Science, University of Bayreuth, Bayreuth, Germany
Venue:
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems
Year:
2010

Citing 7
Cited 0

Why and Where: A Characterization of Data Provenance

ICDT '01 Proceedings of the 8th International Conference on Database Theory
Biodiversity Informatics: The Challenge of Rapid Development, Large Databases, and Complex Data (Keynote)

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
MDA Explained: The Model Driven Architecture: Practice and Promise

MDA Explained: The Model Driven Architecture: Practice and Promise
Generic Model Management: Concepts And Algorithms (Lecture Notes in Computer Science)

Generic Model Management: Concepts And Algorithms (Lecture Notes in Computer Science)
A survey of data provenance in e-science

ACM SIGMOD Record
Model-Driven Software Development: Technology, Engineering, Management

Model-Driven Software Development: Technology, Engineering, Management
Scientific Data Management: Challenges, Technology, and Deployment

Scientific Data Management: Challenges, Technology, and Deployment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Collecting primary data in scientific research is currently being performed in numerous repositories. Frequently, these repositories have not been designed to support long-term evolution of data, processes, and tools. Furthermore, in many cases repositories have been set up for the specific needs of some research project, and are not maintained any longer when the project is terminated. Finally, quality control and data provenance issues are not addressed to a sufficient extent. Based on the experiences gained in a joint project with biologists in the domain of biodiversity informatics, we propose a generic infrastructure for sustainable management of quality controlled primary data. The infrastructure encompasses both project and institutional repositories and provides a process for migrating project data into institutional repositories. Evolution and adaptability are supported through a generic approach with respect to underlying data schemas, processes, and tools. Specific emphasis is placed on quality assurance and data provenance.