Predicting protein function by frequent functional association pattern mining in protein interaction networks

Authors:
Young-Rae Cho;Aidong Zhang
Affiliations:
Department of Computer Science, Baylor University, Waco, TX;Department of Computer Science and Engineering, State University of New York, Buffalo, NY
Venue:
IEEE Transactions on Information Technology in Biomedicine
Year:
2010

Citing 9
Cited 2

Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Efficient Mining of Frequent Subgraphs in the Presence of Isomorphism

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
An Efficient Algorithm for Discovering Frequent Subgraphs

IEEE Transactions on Knowledge and Data Engineering
Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs

Bioinformatics
Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps

Bioinformatics
Efficient estimation of graphlet frequency distributions in protein--protein interaction networks

Bioinformatics
Exploiting indirect neighbours and topological weight to predict protein function from protein--protein interactions

Bioinformatics
Efficient Detection of Network Motifs

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Protein function prediction based on patterns in biological networks

RECOMB'08 Proceedings of the 12th annual international conference on Research in computational molecular biology

A Framework for Incorporating Functional Interrelationships into Protein Function Prediction Algorithms

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Protein annotation from protein interaction networks and Gene Ontology

Journal of Biomedical Informatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Predicting protein function from protein interaction networks has been challenging because of the complexity of functional relationships among proteins. Most previous function prediction methods depend on the neighborhood of or the connected paths to known proteins. However, their accuracy has been limited due to the functional inconsistency of interacting proteins. In this paper, we propose a novel approach for function prediction by identifying frequent patterns of functional associations in a protein interaction network. A set of functions that a protein performs is assigned into the corresponding node as a label. A functional association pattern is then represented as a labeled subgraph. Our frequent labeled subgraph mining algorithm efficiently searches the functional association patterns that occur frequently in the network. It iteratively increases the size of frequent patterns by one node at a time by selective joining, and simplifies the network by a priori pruning. Using the yeast protein interaction network, our algorithm found more than 1400 frequent functional association patterns. The function prediction is performed by matching the subgraph, including the unknown protein, with the frequent patterns analogous to it. By leave-one-out cross validation, we show that our approach has better performance than previous link-based methods in terms of prediction accuracy. The frequent functional association patterns generated in this study might become the foundations of advanced analysis for functional behaviors of proteins in a system level.