A Polynominal Time Approximation Scheme for the Closest Substring Problem

  • Authors:
  • Bin Ma

  • Affiliations:
  • -

  • Venue:
  • COM '00 Proceedings of the 11th Annual Symposium on Combinatorial Pattern Matching
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we study the following problem: Given n strings s1, s2,..., sn, each of length m, find a substring ti of length L for each si, and a string s of length L, such that maxni=1 d(s, ti) is minimized, where d(.,.) is the Hamming distance. The problem was raised in [6] in an application of genetic drug target search and is a key open problem in many applications [7]. The authors of [6] showed that it is NP-hard and can be trivially approximated within ratio 2. A non-trivial approximation algorithm with ratio better than 2 was found in [7]. A major open question in this area is whether there exists a polynomial time approximation scheme (PTAS) for this problem. In this paper, we answer this question positively. We also apply our method to two related problems.