A Two-Step Approach for Clustering Proteins Based on Protein Interaction Profile

  • Authors:
  • Pengjun Pei;Aidong Zhang

  • Affiliations:
  • State University of New York at Buffalo;State University of New York at Buffalo

  • Venue:
  • BIBE '05 Proceedings of the Fifth IEEE Symposium on Bioinformatics and Bioengineering
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

High-throughput methods for detecting protein-protein interactions (PPI) have given researchers an initial global picture of protein interactions on a genomic scale. The huge data sets generated by such experiments pose new challenges in data analysis. Though clustering methods have been successfully applied in many areas in bioinformatics, many clustering algorithms cannot be readily applied on protein interaction data sets. One main problem is that the similarity between two proteins cannot be easily defined. This paper proposes a probabilistic model to define the similarity based on conditional probabilities. We then propose a two-step method for estimating the similarity between two proteins based on protein interaction profile. In the first step, the model is trained with proteins with known annotation. Based on this model, similarities are calculated in the second step. Experiments show that our method improves performance.