Sequence Based Prediction of Protein Mutant Stability and Discrimination of Thermophilic Proteins

  • Authors:
  • M. Michael Gromiha;Liang-Tsung Huang;Lien-Fu Lai

  • Affiliations:
  • Computational Biology Research Center (CBRC), National Institute of Advanced, Industrial Science and Technology (AIST), Tokyo, Japan 135-0064;Department of Computer Science and Information Engineering, MingDao University, Taiwan;Department of Computer Science and Information Engineering, National Changhua University of Education, Taiwan

  • Venue:
  • PRIB '08 Proceedings of the Third IAPR International Conference on Pattern Recognition in Bioinformatics
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Prediction of protein stability upon amino acid substitution and discrimination of thermophilic proteins from mesophilic ones are important problems in designing stable proteins. We have developed a classification rule generator using the information about wild-type, mutant, three neighboring residues and experimentally observed stability data. Utilizing the rules, we have developed a method based on decision tree for discriminating the stabilizing and destabilizing mutants and predicting protein stability changes upon single point mutations, which showed an accuracy of 82% and a correlation of 0.70, respectively. In addition, we have systematically analyzed the characteristic features of amino acid residues in 3075 mesophilic and 1609 thermophilic proteins belonging to 9 and 15 families, respectively, and developed methods for discriminating them. The method based on neural network could discrimi-nate them at the 5-fold cross-validation accuracy of 89% in a dataset of 4684 proteins and 91% in a test set of 707 proteins.