Protein function prediction using weak-label learning
Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
CMSB'12 Proceedings of the 10th international conference on Computational Methods in Systems Biology
Hi-index | 3.84 |
Motivation: Multifunctional proteins perform several functions. They are expected to interact specifically with distinct sets of partners, simultaneously or not, depending on the function performed. Current graph clustering methods usually allow a protein to belong to only one cluster, therefore impeding a realistic assignment of multifunctional proteins to clusters. Results: Here, we present Overlapping Cluster Generator (OCG), a novel clustering method which decomposes a network into overlapping clusters and which is, therefore, capable of correct assignment of multifunctional proteins. The principle of OCG is to cover the graph with initial overlapping classes that are iteratively fused into a hierarchy according to an extension of Newman's modularity function. By applying OCG to a human protein–protein interaction network, we show that multifunctional proteins are revealed at the intersection of clusters and demonstrate that the method outperforms other existing methods on simulated graphs and PPI networks. Availability: This software can be downloaded from http://tagc.univ-mrs.fr/welcome/spip.php?rubrique197 Contact: brun@tagc.univ-mrs.fr Supplementary information:Supplementary data are available at Bioinformatics online.