Text compression
Compression and Coding Algorithms
Compression and Coding Algorithms
ACM Computing Surveys (CSUR)
A Simple Statistical Algorithm for Biological Sequence Compression
DCC '07 Proceedings of the 2007 Data Compression Conference
Human genomes as email attachments
Bioinformatics
The Sequence Alignment/Map format and SAMtools
Bioinformatics
Bioinformatics
Compression of DNA sequence reads in FASTQ format
Bioinformatics
Iterative Dictionary Construction for Compression of Large DNA Data Sets
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Compressing genomic sequence fragments using SLIMGENE
RECOMB'10 Proceedings of the 14th Annual international conference on Research in Computational Molecular Biology
Compression of whole genome alignments using a mixture of finite-context models
ICIAR'12 Proceedings of the 9th international conference on Image Analysis and Recognition - Volume Part I
Hi-index | 0.00 |
Genomic sequence data is being generated in massive quantities, and must be stored in compressed form. Here we examine the combined challenge of storing such data compactly, yet providing bioinformatics researchers with the ability to extract particular regions of interest without needing to fully decompress multi-gigabyte data collections. We focus on data produced in SAM format, which is particularly voluminous in nature, and describe storage techniques that have the desired blend of attributes.