Prediction of protein-protein interaction types using the decision templates based on multiple classier fusion

  • Authors:
  • Wei Chen;Shao-Wu Zhang;Yong-Mei Cheng;Quan Pan

  • Affiliations:
  • College of Automation, Northwestern Polytechnical University, 710072, Xi'an, China;College of Automation, Northwestern Polytechnical University, 710072, Xi'an, China;College of Automation, Northwestern Polytechnical University, 710072, Xi'an, China;College of Automation, Northwestern Polytechnical University, 710072, Xi'an, China

  • Venue:
  • Mathematical and Computer Modelling: An International Journal
  • Year:
  • 2010

Quantified Score

Hi-index 0.98

Visualization

Abstract

Protein-protein interactions (PPIs) play a key role in many cellular processes, such as the regulation of enzymes, signal transduction or mediating the adhesion of cells. Knowing about the multitude of PPIs that allow the cell to function can help the biological scientist understand the molecular machinery of the cell. Unfortunately, it is both time-consuming and expensive to do so solely based on experiments due to the nature of the problem whose complexity is obviously overwhelming, just like the fact that ''life is complicated''. Therefore, developing computational approaches for predicting PPIs, binding sites and interaction types would be of significant value in this regard. In this paper, we propose a novel method for predicting the types of PPIs based on decision templates. First, we introduce the concept of the tensor product to construct three kinds of feature vectors which are the amino acid composition tensor product, the residue multi-scale conservation energy tensor product and the secondary structure content tensor product, then the correlation-based feature selection method was also introduced to reduce the dimensionality of these feature vectors. So, the protein pair can be represented by our three new kinds of feature vectors and Zhu's six kinds of feature vectors. The nine kinds of feature vectors are further taken as the inputs of individual support vector machine classifier respectively, and the outputs of these classifiers are aggregated with decision templates. The overall success rate obtained by jackknife cross-validation was 90.95%, indicating our method is very promising for predicting PPI types, might become a useful vehicle for studying the network biology in the post-genomic era.