Partial digest is hard to solve for erroneous input data

  • Authors:
  • Mark Cieliebak;Stephan Eidenbenz;Paolo Penna

  • Affiliations:
  • Institute of Theoretical Computer Science, ETH Zurich, Zurich, Switzerland;Discrete Simulation Seiences, Los Alamos National Laboratory;Dipartimento di Informatica ed Applicazioni "Renato M. Capocelli" Università di Salerno, Italy

  • Venue:
  • Theoretical Computer Science
  • Year:
  • 2005

Quantified Score

Hi-index 5.23

Visualization

Abstract

The Partial Digest problem asks for the coordinates of m points on a line such that the pairwise distances of the points form a given multiset of (m 2) distances. Partial Digest is a well-studied problem with important applications in physical mapping of DNA molecules. Its computational complexity status is open. Input data for Partial Digest from real-life experiments are always prone to error, which suggests to study variations of Partial Digest that take this fact into account. In this paper, we study the computational complexity of Partial Digest variants that model three different error types that can occur in the data: additional distances, missing distances, and erroneous fragment lengths. We show that these variations are NP-hard, hard to approximate, and strongly NP-hard, respectively.