Approximations and partial solutions for the consensus sequence problem

  • Authors:
  • Amihood Amir;Haim Paryenty;Liam Roditty

  • Affiliations:
  • Department of Computer Science, Bar Ilan University, Ramat Gan, Israel and Department of Computer Science, Johns Hopkins University, Baltimore, MD;Department of Computer Science, Bar Ilan University, Ramat Gan, Israel;Department of Computer Science, Bar Ilan University, Ramat Gan, Israel

  • Venue:
  • SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

The problem of finding the consensus of a given set of strings is formally defined as follows: given a set of strings S = {s1, . . . sk}, and a constant d, find, if it exists, a string s*, such that the Hamming distance of s* from each of the strings does not exceed d. In this paper we study an LP relaxation for the problem. We prove an additive upper bound, depending only in the number of strings k, and randomized bounds. We show that empirical results are much better. We also compare our program with some algorithms reported in the literature, and it is shown to perform well.