Efficient construction of maximal and minimal representations of motifs of a string

  • Authors:
  • François Nicolas;Veli Mäkinen;Esko Ukkonen

  • Affiliations:
  • Department of Computer Science, P.O. Box 68 (Gustaf Hällströmin katu 2b), FIN-00014, University of Helsinki, Finland;Department of Computer Science, P.O. Box 68 (Gustaf Hällströmin katu 2b), FIN-00014, University of Helsinki, Finland;Department of Computer Science, P.O. Box 68 (Gustaf Hällströmin katu 2b), FIN-00014, University of Helsinki, Finland

  • Venue:
  • Theoretical Computer Science
  • Year:
  • 2009

Quantified Score

Hi-index 5.23

Visualization

Abstract

Two substrings of a given text string are called synchronous (occurrence-equivalent) if their sets of occurrence locations are translates of each other. Linear time algorithms are given for the problems of finding a shortest and a longest substring that is synchronous with a given substring. We also introduce approximate variants of the motif discovery problem and give polynomial time algorithms for finding longest and shortest substrings whose suitably translated occurrence location set contains or, respectively, is contained in a given set of locations. The FFT technique used here also leads to an O(nlogn) algorithm for finding the maximum-content gapped motif that is synchronous with a given set of locations; the previously known algorithm for this problem is only quadratic.