On the Structure of Small Motif Recognition Instances

  • Authors:
  • Christina Boucher;Daniel G. Brown;Stephane Durocher

  • Affiliations:
  • D. R. Cheriton School of Computer Science, University of Waterloo, Waterloo, Canada N2L 3G1;D. R. Cheriton School of Computer Science, University of Waterloo, Waterloo, Canada N2L 3G1;D. R. Cheriton School of Computer Science, University of Waterloo, Waterloo, Canada N2L 3G1

  • Venue:
  • SPIRE '08 Proceedings of the 15th International Symposium on String Processing and Information Retrieval
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given a set of sequences, S , and degeneracy parameter, d , the Consensus Sequence problem asks whether there exists a sequence that has Hamming distance at most d from each sequence in S . A valid motif set is a set of sequences for which such a consensus sequence exists, while a decoy set is a set of sequences that does not have a consensus sequence but whose pairwise Hamming distances are all at most 2d . At present, no efficient solution is known to the Consensus Sequence problem when the number of sequences is greater than three. For instances of Consensus Sequence with binary sequences and cardinality four, we present a combinatorial characterization of decoy sets and a linear-time exact algorithm, resolving an open problem posed by Gramm et al. [7].