Identifying periodic occurrences of a template with applications to protein structure
Information Processing Letters
Algorithms on strings, trees, and sequences: computer science and computational biology
Algorithms on strings, trees, and sequences: computer science and computational biology
A guided tour to approximate string matching
ACM Computing Surveys (CSUR)
Hi-index | 0.00 |
An approximate nested tandem repeat (NTR) in a string T is a complex repetitive structure consisting of many approximate copies of two substrings x and X ("motifs") interspersed with one another. NTRs have been found in real DNA sequences and are expected to have applications for evolutionary studies, both as a tool to understand concerted evolution, and as a potential marker in population studies. In this paper we describe software tools developed for database searches for NTRs. After a first program NTRF inder identifies putative NTR motifs, a confirmation step requires the application of the alignment of the putative NTR against exact NTRs built from the putative template motifs x andX. In this paper we describe an algorithm to solve this alignment problem in O(|T|(|x| + |X|)) space and time. Our alignment algorithm is based on Fischetti et al.'s wraparound dynamic programming.