An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
A tutorial on support vector regression
Statistics and Computing
Taxonomy learning: factoring the structure of a taxonomy into a semantic classification decision
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Fast protein classification with multiple networks
Bioinformatics
Hierarchical multi-label prediction of gene function
Bioinformatics
Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples
The Journal of Machine Learning Research
A new method to measure the semantic similarity of GO terms
Bioinformatics
Protein functional class prediction with a combined graph
Expert Systems with Applications: An International Journal
Predicting Novel Human Gene Ontology Annotations Using Semantic Analysis
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
On the Importance of Comprehensible Classification Models for Protein Function Prediction
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Molecular Function Prediction Using Neighborhood Features
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Using the Gene Ontology hierarchy when predicting gene function
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
IEEE Transactions on Information Technology in Biomedicine
On Position-Specific Scoring Matrix for Protein Function Prediction
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Prediction of Protein Functions with Gene Ontology and Interspecies Protein Homology Data
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
True Path Rule Hierarchical Ensembles for Genome-Wide Gene Function Prediction
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Transductive multi-label ensemble classification for protein function prediction
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Protein function prediction using weak-label learning
Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Identification of DNA-Binding and Protein-Binding Proteins Using Enhanced Graph Wavelet Features
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Protein Function Prediction using Multi-label Ensemble Classification
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Hi-index | 0.00 |
The functional annotation of proteins is one of the most important tasks in the post-genomic era. Although many computational approaches have been developed in recent years to predict protein function, most of these traditional algorithms do not take interrelationships among functional terms into account, such as different GO terms usually coannotate with some common proteins. In this study, we propose a new functional similarity measure in the form of Jaccard coefficient to quantify these interrelationships and also develop a framework for incorporating GO term similarity into protein function prediction process. The experimental results of cross-validation on S. cerevisiae and Homo sapiens data sets demonstrate that our method is able to improve the performance of protein function prediction. In addition, we find that small size terms associated with a few of proteins obtain more benefit than the large size ones when considering functional interrelationships. We also compare our similarity measure with other two widely used measures, and results indicate that when incorporated into function prediction algorithms, our proposed measure is more effective. Experiment results also illustrate that our algorithms outperform two previous competing algorithms, which also take functional interrelationships into account, in prediction accuracy. Finally, we show that our method is robust to annotations in the database which are not complete at present. These results give new insights about the importance of functional interrelationships in protein function prediction.