Author Name Disambiguation for Citations Using Topic and Web Correlation
ECDL '08 Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries
IdentityRank: Named entity disambiguation in the news domain
Expert Systems with Applications: An International Journal
Disambiguating authors in citations on the web and authorship correlations
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
The huge volumes of biomedical texts available online drives the increasing need for automated techniques to analyze and extract knowledge from these repositories of information. Resolving the ambiguity in biological terms in these texts is an important step for developing efficient knowledge discovery techniques. In this paper, we present a new method for biomedical term disambiguation in biomedical texts. The method is based on machine learning and can be viewed as a word classification task. We evaluated the method on geneprotein name disambiguation using Medline abstracts from years 1999-2003 containing about 3000 to 6000 gene and protein names. The technique is effective in disambiguating gene and protein names, achieving impressive accuracy, precision, and recall, with accuracy approaching about 90%, and outperforming the recently published results on this problem. Our technique is also applicable for the general problem of named entity disambiguation.