cWINNOWER Algorithm for Finding Fuzzy DNA Motifs

  • Authors:
  • Shoudan Liang

  • Affiliations:
  • -

  • Venue:
  • CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The c WINNOWER algorithm detects.fuzzy motifs in DNAsequences rich in protein-binding signals. A signal is definedas any short nucleotide pattern having up to d mutationsdiffering from a motif of length l. The algorithm findssuch motifs ifmultiple mutated copies of the motif (i.e., thesignals) are present in the DNA sequence in sufficient abundance.The cWINNOWER algorithm substantially improvesthe sensitivity of the winnower method of Pevzner and Szeby imposing a consensus constraint, enabling it to detectmuch weaker signals. We studied the minimum number ofdetectable motifs qc as a function of sequence length N forrandom sequences. We found that qc increases linearly withN for a fast version of the algorithm based on countingthree-member sub-cliques. Imposing consensus constraintsreduces qc by a factor of three in this case, which makes thealgorithm dramatically more sensitive. Our most sensitivealgorithm, which counts four-member sub-cliques, needs aminimum of only 13 signals to detect motifs in a sequenceof length N = 12000 for (l,d) = (15,4).