Remote homology detection on alpha-structural proteins using simulated evolution

  • Authors:
  • Mengfei Cao;Lenore J. Cowen

  • Affiliations:
  • Tufts University, Medford, MA;Tufts University, Medford, MA

  • Venue:
  • Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the most widely-used methods to date for recognizing protein sequences that are evolutionarily related, termed homologs, has been profile hidden Markov models. For the cases where positive training data for these methods is sparse, Kumar and Cowen in 2009 introduced the paradigm of simulated evolution, a randomized algorithm to construct additional artificial training sequences, which are generated based on a highly simplistic model of how protein sequences evolve. These artificial sequences are then used together with the true positive training sequences to learn the profile hidden Markov model. Kumar and Cowen showed that augmenting the training set with a simple pointwise model of simulated evolution improved the detection of remote homologs for profile hidden Markov models. In 2010, they then constructed a model of simulated evolution that captures the pairwise statistical preferences of residues that are hydrogen bonded in beta-sheets, and showed that it improved the ability of hidden Markov models to recognize remote homologs for beta-structural motifs. In this work, we explore how best to extend the paradigm of simulated evolution to alpha-helical motifs. We determine that simulated evolution can also improve the performance for profile hidden Markov models on detecting remote homologs of alpha-structural proteins.