A conceptual model for transcriptome high-throughput sequencing pipeline

  • Authors:
  • Ruben Cruz Huacarpuma;Ruben Cruz Huacarpuma;Maria Emilia Walter

  • Affiliations:
  • Department of Computer Science, University of Brasilia, Brazil;Department of Computer Science, University of Brasilia, Brazil;Department of Computer Science, University of Brasilia, Brazil

  • Venue:
  • BSB'11 Proceedings of the 6th Brazilian conference on Advances in bioinformatics and computational biology
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In recent years, high-throughput sequencers have been generating enourmous volumes of data in hundreds of genome projects around the world. Besides being stored, the original data are transformed through multiple analysis that are realized in a computational pipeline. This poses important problems for treating these highly complex data. In this context, a model to represent, organize and guarantee accessibility, correctness and understandability to these data is essential to support the work of the biologists involved in a transcriptome project. Different formats of data, terminologies, file structures and ontologies turn data management very difficult. In this work, we propose a conceptual model for the different phases of a transcriptome high-throughput sequencing pipeline in order to represent and manage data.