Transforming cabbage into turnip: polynomial algorithm for sorting signed permutations by reversals
STOC '95 Proceedings of the twenty-seventh annual ACM symposium on Theory of computing
Spectrum Alignment: Efficient Resequencing by Hybridization
Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
International Journal of Computer Applications in Technology
Brief Communication: Whole genome assembly from 454 sequencing output via modified DNA graph concept
Computational Biology and Chemistry
An efficient hybrid approach to correcting errors in short reads
MDAI'11 Proceedings of the 8th international conference on Modeling decisions for artificial intelligence
On the complexity of the Eulerian closed walk with precedence path constraints problem
Theoretical Computer Science
Lightweight algorithms for constructing and inverting the BWT of string collections
Theoretical Computer Science
Hi-index | 0.00 |
For the last twenty years fragment assembly in DNA sequencing followed the “overlap - layout - consensus” paradigm that is used in all currently available assembly tools. Although this approach proved to be useful in assembling clones, it faces difficulties in genomic shotgun assembly: the existing algorithms make assembly errors and are often unable to resolve repeats even in prokaryotic genomes. Biologists are well-aware of these errors and are forced to carry additional experiments to verify the assembled contigs.We abandon the classical “overlap - layout - consensus” approach in favor of a new Eulerian Superpath approach that, for the first time, resolves the problem of repeats in fragment assembly. Our main result is the reduction of the fragment assembly to a variation of the classical Eulerian path problem. This reduction opens new possibilities for repeat resolution and allows one to generate error-free solutions of the large-scale fragment assembly problems. The major improvement of EULER over other algorithms is that it resolves all repeats except long perfect repeats that are theoretically impossible to resolve without additional experiments.