Protein Complexes Discovery Based on Protein-Protein Interaction Data via a Regularized Sparse Generative Network Model

Authors:
Xiao-Fei Zhang;Dao-Qing Dai;Xiao-Xin Li
Affiliations:
Sun Yat-Sen University, Guangzhou;Sun Yat-Sen University, Guangzhou;Sun Yat-Sen University, Guangzhou
Venue:
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Year:
2012

Citing 23
Cited 2

Stochastic models for the Web graph

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Protein complex prediction via cost-based clustering

Bioinformatics
Modeling interactome: scale-free or geometric?

Bioinformatics
GO: :TermFinder---open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes

Bioinformatics
CFinder: locating cliques and overlapping modules in biological networks

Bioinformatics
Modular organization of protein interaction networks

Bioinformatics
Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples

The Journal of Machine Learning Research
Exploring biological network structure using exponential random graph models

Bioinformatics
Fitting a geometric graph to a protein–protein interaction network

Bioinformatics
Protein complex identification by supervised graph local clustering

Bioinformatics
Ensemble non-negative matrix factorization methods for clustering protein–protein interactions

Bioinformatics
Mixed Membership Stochastic Blockmodels

The Journal of Machine Learning Research
Complex discovery from weighted PPI networks

Bioinformatics
How and when should interactome-derived clusters be used to predict functional modules and protein function?

Bioinformatics
SPICi

Bioinformatics
Modeling Protein Interacting Groups by Quasi-Bicliques: Complexity, Algorithm, and Application

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Semi-Supervised Learning

Semi-Supervised Learning
A Poisson model for random multigraphs

Bioinformatics
A Fast Hierarchical Clustering Algorithm for Functional Modules Discovery in Protein Interaction Networks

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A Max-Flow-Based Approach to the Identification of Protein Complexes Using Protein Interaction and Microarray Data

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Graph Regularized Nonnegative Matrix Factorization for Data Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Bioinformatics
A Framework for Incorporating Functional Interrelationships into Protein Function Prediction Algorithms

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)

Biomarker Identification and Cancer Classification Based on Microarray Data Using Laplace Naive Bayes Model with Mean Shrinkage

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Identifying Spurious Interactions and Predicting Missing Interactions in the Protein-Protein Interaction Networks via a Generative Network Model

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Detecting protein complexes from protein interaction networks is one major task in the postgenome era. Previous developed computational algorithms identifying complexes mainly focus on graph partition or dense region finding. Most of these traditional algorithms cannot discover overlapping complexes which really exist in the protein-protein interaction (PPI) networks. Even if some density-based methods have been developed to identify overlapping complexes, they are not able to discover complexes that include peripheral proteins. In this study, motivated by recent successful application of generative network model to describe the generation process of PPI networks and to detect communities from social networks, we develop a regularized sparse generative network model (RSGNM), by adding another process that generates propensities using exponential distribution and incorporating Laplacian regularizer into an existing generative network model, for protein complexes identification. By assuming that the propensities are generated using exponential distribution, the estimators of propensities will be sparse, which not only has good biological interpretation but also helps to control the overlapping rate among detected complexes. And the Laplacian regularizer will lead to the estimators of propensities more smooth on interaction networks. Experimental results on three yeast PPI networks show that RSGNM outperforms six previous competing algorithms in terms of the quality of detected complexes. In addition, RSGNM is able to detect overlapping complexes and complexes including peripheral proteins simultaneously. These results give new insights about the importance of generative network models in protein complexes identification.