Research article: A local average connectivity-based method for identifying essential proteins from the network level

  • Authors:
  • Min Li;Jianxin Wang;Xiang Chen;Huan Wang;Yi Pan

  • Affiliations:
  • School of Information Science and Engineering, Central South University, Changsha 410083, PR China and Department of Computer Science, Georgia State University, Atlanta, GA 30302-4110, USA;School of Information Science and Engineering, Central South University, Changsha 410083, PR China;School of Information Science and Engineering, Central South University, Changsha 410083, PR China;School of Information Science and Engineering, Central South University, Changsha 410083, PR China;School of Information Science and Engineering, Central South University, Changsha 410083, PR China and Department of Computer Science, Georgia State University, Atlanta, GA 30302-4110, USA

  • Venue:
  • Computational Biology and Chemistry
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Identifying essential proteins is very important for understanding the minimal requirements of cellular survival and development. Fast growth in the amount of available protein-protein interactions has produced unprecedented opportunities for detecting protein essentiality from the network level. Essential proteins have been found to be more abundant among those highly connected proteins. However, there exist a number of highly connected proteins which are not essential. By analyzing these proteins, we find that few of their neighbors interact with each other. Thus, we propose a new local method, named LAC, to determine a protein's essentiality by evaluating the relationship between a protein and its neighbors. The performance of LAC is validated based on the yeast protein interaction networks obtained from two different databases: DIP and BioGRID. The experimental results of the two networks show that the number of essential proteins predicted by LAC clearly exceeds that explored by Degree Centrality (DC). More over, LAC is also compared with other seven measures of protein centrality (Neighborhood Component (DMNC), Betweenness Centrality (BC), Closeness Centrality (CC), Bottle Neck (BN), Information Centrality (IC), Eigenvector Centrality (EC), and Subgraph Centrality (SC)) in identifying essential proteins. The comparison results based on the validations of sensitivity, specificity, F-measure, positive predictive value, negative predictive value, and accuracy consistently show that LAC outweighs these seven previous methods.