Constant time approximation scheme for largest well predicted subset

  • Authors:
  • Bin Fu;Lusheng Wang

  • Affiliations:
  • Department of Computer Science, University of Texas-Pan American Edinburg, TX;Department of Computer Science, City University of Hong Kong, Hong Kong

  • Venue:
  • COCOON'10 Proceedings of the 16th annual international conference on Computing and combinatorics
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The largest well predicted subset problem is formulated for comparison of two predicted 3D protein structures from the same sequence. Given two ordered point sets A = {a1, ..., an} and B = {b1, b2, ... bn} containing n points, and a threshold d, the largest well predicted subset problem is to find the rigid transformation T for a largest subset Bopt of B such that the distance between ai and T(bi) is at most d for every bi in Bopt. A meaningful prediction requires that the size of Bopt is at least an for some constant a [8]. We use LWPS(A,B, d, α) to denote the largest well predicted subset problem with meaningful prediction. An (1 + δ1, 1 - δ2)-approximation for LWPS(A,B, d, α) is to find a transformation T to bring a subset B′ ⊆ B of size at least (1-δ2)|Bopt| such that for each bi ∈ B′, the Euclidean distance between the two points distance(ai, T(bi)) ≤ (1 + δ1)d. We develop a constant time (1 + δ1, 1 - δ2)-approximation algorithm for LWPS(A,B, d, α) for arbitrary positive constants δ1 and δ2.