Sparse LCS common substring alignment

Authors:
Gad M. Landau;Baruch Schieber;Michal Ziv-Ukelson
Affiliations:
Department of Computer Science, Haifa University, Haifa, Israel and Department of Computer and Information Science, Polytechnic University, Six MetroTech Center, Brooklyn, NY;IBM T.J. Watson Research Center, Yorktown Heights, NY;Department of Computer Science, Haifa University, Haifa, Israel and IBM T.J. Watson Research Center, Yorktown Heights, NY
Venue:
CPM'03 Proceedings of the 14th annual conference on Combinatorial pattern matching
Year:
2003

Citing 11
Cited 2

Sparse dynamic programming II: convex and concave cost functions

Journal of the ACM (JACM)
A space efficient algorithm for finding the best nonoverlapping alignment score

Theoretical Computer Science
An Algorithm for Locating Nonoverlapping Regions of Maximum Alignment Score

SIAM Journal on Computing
Algorithms on strings, trees, and sequences: computer science and computational biology

Algorithms on strings, trees, and sequences: computer science and computational biology
String editing and longest common subsequences

Handbook of formal languages, vol. 2
All Highest Scoring Paths in Weighted Grid Graphs and Their Application to Finding All Approximate Repeats in Strings

SIAM Journal on Computing
Algorithms for the Longest Common Subsequence Problem

Journal of the ACM (JACM)
On the shared substring alignment problem

SODA '00 Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms
A fast algorithm for computing longest common subsequences

Communications of the ACM
On the common substring alignment problem

Journal of Algorithms
A sub-quadratic sequence alignment algorithm for unrestricted cost matrices

SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms

Two algorithms for LCS Consecutive Suffix Alignment

Journal of Computer and System Sciences
Ontology driven bee's foraging approach based self adaptive online recommendation system

Journal of Systems and Software

Quantified Score

Hi-index	0.00

Visualization

Abstract

The "Common Substring Alignment" problem is defined as follows. The input consists of a set of strings S1, S2 ... Sc, with a common substring appearing at least once in each of them, and a target string T. The goal is to compute similarity of all strings Si with T, without computing the part of the common substring over and over again. In this paper we consider the Common Substring Alignment problem for the LCS (Longest Common Subsequence) similarity metric. Our algorithm gains its efficiency by exploiting the sparsity inherent to the LCS problem. Let Y be the common substring, n be the size of the compared sequences, Ly be the length of the LCS of T and Y, denoted |LCS[T, Y]|, and L be max{|LCS[T,Si]|}. Our algorithm consists of an O(nLy) time encoding stage that is executed once per common substring, and an O(L) time alignment stage that is executed once for each appearance of the common substring in each source string. The additional running time depends only on the length of the parts of the strings that are not in any common substring.