A Weighted k-Nearest Neighbor Method for Gene Ontology Based Protein Function Prediction

Authors:
Saket Kharsikar;Dale Mugler;Daniel Sheffer;Francisco Moore;and Zhong-Hui Duan
Affiliations:
-;-;-;-;-
Venue:
IMSCCS '07 Proceedings of the Second International Multi-Symposiums on Computer and Computational Sciences
Year:
2007

Citing 0
Cited 4

3PRS: a personalized popular program recommendation system for digital TV for P2P social networks

Multimedia Tools and Applications
Identifying gene functions using functional expression profiles obtained by voxelation

Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
CPRS: A cloud-based program recommendation system for digital TV platforms

Future Generation Computer Systems
A cloud-based intelligent TV program recommendation system

Computers and Electrical Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Numerous genome projects have produced a large and ever increasing amount of genomic sequence data. However, the biological functions of many proteins encoded by the sequences remain unknown. Protein function annotation and prediction become an essential and challenging task of post-genomic research. In this paper, we present an automated protein function prediction system based on a set of proteins of known biological functions. The functions of the proteins are characterized with gene ontology (GO) annotations. The prediction system uses a novel measure to calculate the pair-wise overall similarity between protein sequences. The protein function prediction is performed based on the GO annotations of similar sequences using a weighted k-nearest neighbor method. We show the prediction accuracies obtained using the model organism yeast (Sacchyromyces cerevisiae). The results indicate that the weighted k-nearest neighbor method significantly outperforms the regular k-nearest neighbor method for protein molecular function prediction.