Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
WordNet: a lexical database for English
Communications of the ACM
SIGDOC '86 Proceedings of the 5th annual international conference on Systems documentation
Introduction to the special issue on word sense disambiguation: the state of the art
Computational Linguistics - Special issue on word sense disambiguation
Word-sense disambiguation using statistical models of Roget's categories trained on large corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Word sense disambiguation using sense examples automatically acquired from a second language
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Web query disambiguation using PageRank
Journal of the American Society for Information Science and Technology
Hi-index | 0.00 |
This paper presents a novel unsupervised methodology for automatic disambiguation of nouns found in unrestricted corpora. The proposed method is based on extending the context of a target word by querying the web, and then measuring the overlap of the extended context with the topic signatures of the different senses by using Bayes rule. The algorithm is evaluated on Semcor 2.0. The evaluation showed that the web-based extension of the target word's local context increases the amount of contextual information to perform semantic interpretation, in effect producing a disambiguation methodology, which achieves a result comparable to the performance of the best system in SENSEVAL 3.