Managing data from high-throughput genomic processing: a case study

Authors:
Toby Bloom;Ted Sharpe
Affiliations:
Broad Institute of MIT and Harvard;Broad Institute of MIT and Harvard
Venue:
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Year:
2004

Citing 0
Cited 2

The integrated microbial genomes (IMG) system: a case study in biological data management

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Algorithm engineering: bridging the gap between algorithm theory and practice

Algorithm engineering: bridging the gap between algorithm theory and practice

Quantified Score

Hi-index	0.00

Visualization

Abstract

Genomic data has become the canonical example of very large, very complex data sets. As such, there has been significant interest in ways to provide targeted database support to address issues that arise in genomic processing. Whether genomic data is truly a special case, or just another application area exhibiting problems common to other domains, is an as yet unanswered question. In this abstract, we explore the structure and processing requirements of a large-scale genome sequencing center, as a case study of the issues that arise in genomic data managements, and as a means to compare those issues with those that arise in other domains.