Randomized algorithms
Better filtering with gapped q-grams
Fundamenta Informaticae - Special issue on computing patterns in strings
A metric model of amino acid substitution
Bioinformatics
Tracking repeats using significance and transitivity
Bioinformatics
Tandem repeats over the edit distance
Bioinformatics
Multiple spaced seeds for homology search
Bioinformatics
Bioinformatics
Metric-space search in bioinformatics
SIGSPATIAL Special
Hi-index | 0.00 |
Tandem repetitions within protein amino acid sequences often correspond to regular secondary structures and form multi-repeat 3D assemblies of varied size and function. Developing internal repetitions is one of the evolutionary mechanisms that proteins employ to adapt their structure and function under evolutionary pressure. While there is keen interest in understanding such phenomena, detection of repeating structures based only on sequence analysis is considered an arduous task, since structure and function is often preserved even under considerable sequence divergence. In this paper we present PTRStalker, a new algorithm for ab-initio detection of very fuzzy tandem repeats in protein amino acid sequences. In the reported results we show that by feeding PTRStalker with amino acid sequences from the UniProtKB/Swiss-Prot database we detect novel tandemly repeated structures not captured by other state-of-the-art tools.