Extracting structured motifs using a suffix tree—algorithms and application to promoter consensus identification

  • Authors:
  • Laurent Marsan;Marie-France Sagot

  • Affiliations:
  • Institut Gaspard Monge, Université de Marne la Vallée, 2, rue de la Butte Verte, 93160 - Noisy le Grand;Institut Gaspard Monge, Université de Marne la Vallée, 2, rue de la Butte Verte, 93160 - Noisy le Grand and Institut Pasteur, Service d'Informatique Scientifique, 28, rue du Dr. Roux, 75 ...

  • Venue:
  • RECOMB '00 Proceedings of the fourth annual international conference on Computational molecular biology
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper introduces two exact algorithms for extracting conserved structured motifs from a set of DNA sequences. Structured motifs are composed of p ⪈ 2 parts separated by constrained spacers These algorithms use a suffix tree for fulfilling this task. They are efficient enough to be able to extract site consensus, such as promoter sequences, from a whole collection of non coding sequences extracted from a genome. In particular, their time complexity scales linearly with N2n where n is the average length of the sequences and N their number. An application with interesting results to the identification of promoter consensus sequences in bacterial genomes is shown.