Polynomial-time approximation algorithms for weighted LCS problem

  • Authors:
  • Marek Cygan;Marcin Kubica;Jakub Radoszewski;Wojciech Rytter;Tomasz Waleń

  • Affiliations:
  • Dept. of Mathematics, Computer Science and Mechanics, University of Warsaw, Warsaw, Poland;Dept. of Mathematics, Computer Science and Mechanics, University of Warsaw, Warsaw, Poland;Dept. of Mathematics, Computer Science and Mechanics, University of Warsaw, Warsaw, Poland;Dept. of Mathematics, Computer Science and Mechanics, University of Warsaw, Warsaw, Poland and Dept. of Math. and Informatics, Copernicus University, Toruń, Poland;Dept. of Mathematics, Computer Science and Mechanics, University of Warsaw, Warsaw, Poland

  • Venue:
  • CPM'11 Proceedings of the 22nd annual conference on Combinatorial pattern matching
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We deal with a variant of the well-known Longest Common Subsequence (LCS) problem for weighted sequences. A (biological) weighted sequence determines the probability for each symbol to occur at a given position of the sequence (such sequences are also called Position Weighted Matrices, PWM). Two possible such versions of the problem were proposed by (Amir et al., 2009 and 2010), they are called LCWS and LCWS2 (Longest Common Weighted Subsequence 1 and 2 Problem). We solve an open problem, stated in conclusions of the paper by Amir et al., of the tractability of a log-probability version of LCWS2 problem for bounded alphabets, showing that it is NP-hard already for an alphabet of size 2. We also improve the (1/|Σ|)-approximation algorithm given by Amir et al. (where Σ is the alphabet): we show a polynomial-time approximation scheme (PTAS) for the LCWS2 problem using O(n5) space. We also give a simpler (1/2)-approximation algorithm for the same problem using only O(n2) space.