Algorithms on strings, trees, and sequences: computer science and computational biology
Algorithms on strings, trees, and sequences: computer science and computational biology
On bipartite and multipartite clique problems
Journal of Algorithms
Introduction to Algorithms
The Enhanced Suffix Array and Its Applications to Genome Analysis
WABI '02 Proceedings of the Second International Workshop on Algorithms in Bioinformatics
Fast and Sensitive Alignment of Large Genomic Sequences
CSB '02 Proceedings of the IEEE Computer Society Conference on Bioinformatics
EMAGEN: an efficient approach to multiple whole genome alignment
APBC '04 Proceedings of the second conference on Asia-Pacific bioinformatics - Volume 29
Hi-index | 0.00 |
In this paper we present an algorithm to discover the optimal solution space of genome sequence alignment. The solution space is an LIS (longest increasing subsequence) graph, which can be constructed in O(n2) time, where n is the number of MUMs in given genome sequences. The LIS graph has many unique properties that lead to the development of efficient decomposition algorithms. With the availability of the solution space, users can examine all possible solutions and pick the one that best meet their needs. We provide the decomposition tree traversal algorithm, which outputs the solution in an elegant and compact clustered form. Moreover, the solution space can also be used to study the homology of genome sequences.