An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Fuzzy Measures on the Gene Ontology for Gene Product Similarity
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A new method to measure the semantic similarity of GO terms
Bioinformatics
Using information content to evaluate semantic similarity in a taxonomy
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Beyond TFIDF weighting for text categorization in the vector space model
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Bioinformatics
A New Measure Based on Gene Ontology for Semantic Similarity of Genes
ICIE '10 Proceedings of the 2010 WASE International Conference on Information Engineering - Volume 01
DOPCA: A New Method for Calculating Ontology-Based Semantic Similarity
ICIS '11 Proceedings of the 2011 10th IEEE/ACIS International Conference on Computer and Information Science
Hi-index | 0.00 |
The Gene Ontology (GO) provides a controlled vocabulary of terms for describing genes from different data resources. In this paper, we proposed a novel method determining semantic similarity of genes based on GO. The key principle of our method relies on the introduction of Term Frequency (TF) and Inverse Document Frequency (IDF) to quantify the weights of different GO terms to the same gene. Different from previous leading methods, our method needs no parameters and computes the gene similarity directly rather than term similarity first. Experimental results of clustering genes in biological pathways from Saccharomyces Genome Database (SGD) have demonstrated that our method is quite competitive and outperforms leading method in certain cases.