Predicting protein complexes in protein interaction networks using a core-attachment algorithm based on graph communicability

Authors:
Xiaoke Ma;Lin Gao
Affiliations:
School of Computer Science and Technology, Xidian University, P.O. Box 171, No. 2 South TaiBai Road, Xi'an, Shaanxi 710071, PR China;School of Computer Science and Technology, Xidian University, P.O. Box 171, No. 2 South TaiBai Road, Xi'an, Shaanxi 710071, PR China
Venue:
Information Sciences: an International Journal
Year:
2012

Citing 21
Cited 1

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Enumerating all connected maximal common subgraphs in two graphs

Theoretical Computer Science
Algorithm 457: finding all cliques of an undirected graph

Communications of the ACM
Protein complex prediction via cost-based clustering

Bioinformatics
CFinder: locating cliques and overlapping modules in biological networks

Bioinformatics
Exploiting indirect neighbours and topological weight to predict protein function from protein--protein interactions

Bioinformatics
The worst-case time complexity for generating all maximal cliques and computational experiments

Theoretical Computer Science - Computing and combinatorics
Clustering by common friends finds locally significant proteins mediating modules

Bioinformatics
From pull-down data to protein interaction networks and complexes with biological relevance

Bioinformatics
Maximal fuzzy maps

Information Sciences: an International Journal
Maximum entropy membership functions for discrete fuzzy variables

Information Sciences: an International Journal
Identifying the topology of protein complexes from affinity purification assays

Bioinformatics
Protein complex prediction based on simultaneous protein interaction network

Bioinformatics
Integrating induction and deduction for noisy data mining

Information Sciences: an International Journal
Bootstrapping the interactome: unsupervised identification of protein complexes in yeast

RECOMB'08 Proceedings of the 12th annual international conference on Research in computational molecular biology
Incorporating multiple genomic features with the utilization of interacting domain patterns to improve the prediction of protein-protein interactions

Information Sciences: an International Journal
Fuzzy ε-subgroups

Information Sciences: an International Journal
Accelerating spectral clustering with partial supervision

Data Mining and Knowledge Discovery
Noise-robust algorithm for identifying functionally associated biclusters from gene expression data

Information Sciences: an International Journal
Identification of protein complexes from co-immunoprecipitation data

Bioinformatics
Greedy-type resistance of combinatorial problems

Discrete Optimization

Revealing network communities with a nonlinear programming method

Information Sciences: an International Journal

Quantified Score

Hi-index	0.07

Visualization

Abstract

Studying protein complexes is very important in biological processes because it helps reveal the structure-functionality relationships in a protein complex. Much attention has been paid to accurately predicting the protein complexes from the increasing amount of protein-protein interaction (PPI) data. Almost all of the current algorithms that concern the detection of protein complexes focus on discovering dense subgraphs based on the observation that dense subgraphs in a biological network may correspond to protein complexes. However, such an assumption would throw away further topological information about complexes. In this paper introducing the core-attachment concept, a novel core-attachment algorithm is developed by detecting the cores and attachments, respectively. To detect the cores of protein complexes, a virtual network is constructed using the eigenvalues and eigenvectors of the network involved, where each maximal clique corresponds to a protein complex core. With this notion, the problem of detecting protein complex cores is transformed into the classic all-cliques problem. The attachments are then merged into their cores to yield biologically meaningful complexes. A comprehensive comparison between MCL, DPClus, Coach, DECAFF and our algorithm has been made by comparing the predicted protein complexes with the benchmarked complexes. Experimental results indicate that our algorithm outperforms the MCL, DPClus and DECAFF and has comparative performance with the Coach algorithm in terms of the accuracy of prediction. Moreover, the detected complexes with core-attachment structures match well with the benchmark data, demonstrating that our algorithm can provide more insightful perspectives. Robustness analysis further shows that the algorithm is very robust.