Information storage and retrieval
Information storage and retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Modern Information Retrieval
Probabilistic question answering on the Web: Research Articles
Journal of the American Society for Information Science and Technology
A knowledge-driven approach to cluster validity assessment
Bioinformatics
A Vector Space Search Engine forWeb Services
ECOWS '05 Proceedings of the Third European Conference on Web Services
Measuring semantic similarity between Gene Ontology terms
Data & Knowledge Engineering
Hi-index | 0.00 |
We propose an approach for quantifying the biological relatedness between gene products, based on their properties, and measure their similarities using exclusively statistical NLP techniques and Gene Ontology (GO) annotations. We also present a novel similarity figure of merit, based on the vector space model, which assesses gene expression analysis results and scores gene product clusters' biological coherency, making sole use of their annotation terms and textual descriptions. We define query profiles which rapidly detect a gene product cluster's dominant biological properties. Experimental results validate our approach, and illustrate a strong correlation between our coherency score and gene expression patterns.