A consensus approach to predicting protein contact map via logistic regression

  • Authors:
  • Jian-Yi Yang;Xin Chen

  • Affiliations:
  • Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Nanyang Link, Singapore;Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Nanyang Link, Singapore

  • Venue:
  • ISBRA'11 Proceedings of the 7th international conference on Bioinformatics research and applications
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Prediction of protein contact map is of great importance since it can facilitate and improve the prediction of protein 3D structure. However, the prediction accuracy is notoriously known to be rather low. In this paper, a consensus contact map prediction method called LRcon is developed, which combines the prediction results from several complementary predictors by using a logistic regression model. Tests on the targets from the recent CASP9 experiment and a large dataset D856 consisting of 856 protein chains show that LRcon not only outperforms its component predictors but also the simple averaging and voting schemes. For example, LRcon achieves 41.5% accuracy on the D856 dataset for the top L/10 long-range contact predictions, which is about 5% higher than its best-performed component predictor. The improvements made by LRcon are mainly attributed to the application of a consensus approach to complementary predictors and the logistic regression analysis under the machine learning framework.