A conceptual model for transcriptome high-throughput sequencing pipeline

Authors:
Ruben Cruz Huacarpuma;Ruben Cruz Huacarpuma;Maria Emilia Walter
Affiliations:
Department of Computer Science, University of Brasilia, Brazil;Department of Computer Science, University of Brasilia, Brazil;Department of Computer Science, University of Brasilia, Brazil
Venue:
BSB'11 Proceedings of the 6th Brazilian conference on Advances in bioinformatics and computational biology
Year:
2011

Citing 4
Cited 0

Object Database Standard: ODMG-93

Object Database Standard: ODMG-93
Fundamentals of Database Systems, Fourth Edition

Fundamentals of Database Systems, Fourth Edition
A Conceptual Data Model Language for the Molecular Biology Domain

CBMS '07 Proceedings of the Twentieth IEEE International Symposium on Computer-Based Medical Systems
Enhanced bioinformatics data modeling concepts and their use in querying and integration

Enhanced bioinformatics data modeling concepts and their use in querying and integration

Quantified Score

Hi-index	0.00

Visualization

Abstract

In recent years, high-throughput sequencers have been generating enourmous volumes of data in hundreds of genome projects around the world. Besides being stored, the original data are transformed through multiple analysis that are realized in a computational pipeline. This poses important problems for treating these highly complex data. In this context, a model to represent, organize and guarantee accessibility, correctness and understandability to these data is essential to support the work of the biologists involved in a transcriptome project. Different formats of data, terminologies, file structures and ontologies turn data management very difficult. In this work, we propose a conceptual model for the different phases of a transcriptome high-throughput sequencing pipeline in order to represent and manage data.