A parameterized algorithm for protein structure alignment

  • Authors:
  • Jinbo Xu;Feng Jiao;Bonnie Berger

  • Affiliations:
  • Department of Mathematics and Computer Science and AI Laboratory, MIT;School of Computer Science, University of Waterloo, Canada;Department of Mathematics and Computer Science and AI Laboratory, MIT

  • Venue:
  • RECOMB'06 Proceedings of the 10th annual international conference on Research in Computational Molecular Biology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a parameterized algorithm for aligning two protein structures, in the case where one protein structure is represented by a contact map graph and the other by a contact map graph or a distance matrix. If the sequential order of alignment is not required, the time complexity is polynomial in the protein size and exponential with respect to two parameters $\frac{D_u}{D_l}$ and $\frac{D_c}{D_l}$, which usually can be treated as constants. In particular, Du is the distance threshold determining if two residues are in contact or not, Dc is the maximally allowed distance between two matched residues after two proteins are superimposed, and Dl is the minimum inter-residue distance in a typical protein. This result indicates that if both $\frac{D_u}{D_l}$ and $\frac{D_c}{D_l}$ are small enough, then there is a polynomial-time approximation scheme for the non-sequential protein structure alignment problem. Empirically, both $\frac{D_u}{D_l}$ and $\frac{D_c}{D_l}$ are very small and can be treated as constants. This result clearly demonstrates that the hardness of the contact-map based protein structure alignment problem is related not to protein size but to several parameters, which depend on how the protein structure alignment problem is modeled. The result is achieved by decomposing the protein structure using tree decomposition and discretizing the rigid-body transformation space. We have implemented our algorithm and preliminary experimental results indicate that on a Linux PC, it takes from ten minutes to one hour to align two proteins with approximately 100 residues.