Two web-based approaches for noun sense disambiguation

Authors:
Paolo Rosso;Manuel Montes-y-Gómez;Davide Buscaldi;Aarón Pancardo-Rodríguez;Luis Villaseñor Pineda
Affiliations:
Dpto. de Sistemas Informáticos y Computación (DSIC), Universidad Politécnica de Valencia, Spain;Dpto. de Sistemas Informáticos y Computación (DSIC), Universidad Politécnica de Valencia, Spain;Dipartimento di Informatica e Scienze dell'Informazione (DISI), Università di Genova, Italy;Lab. de Tecnologías del Lenguaje, Instituto Nacional de Astrofísica, Optica y Electrónica, Mexico;Lab. de Tecnologías del Lenguaje, Instituto Nacional de Astrofísica, Optica y Electrónica, Mexico
Venue:
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Year:
2005

Citing 8
Cited 7

An automatic method for generating sense tagged corpora

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
An Information-Theoretic Definition of Similarity

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Automatic association of web directories with word senses

Computational Linguistics - Special issue on web as corpus
A method for word sense disambiguation of unrestricted text

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
A language independent method for question classification

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Automatic noun sense disambiguation

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Processing natural language without natural language processing

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Exploring automatic word sense disambiguation with decision lists and the web

Proceedings of the COLING-2000 Workshop on Semantic Annotation and Intelligent Content

Lexical and Semantic Resources for NLP: From Words to Meanings

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part III
Semantic disambiguation of taxonomies

Proceedings of the 2007 conference on Artificial Intelligence Research and Development
HIT-WSD: using search engine for multilingual Chinese-English lexical sample task

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Unsupervised translation disambiguation based on maximum web bilingual relatedness: web as lexicon

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
Word sense disambiguation based on word sense clustering

IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
An automatic approach for ontology-based feature extraction from heterogeneous textualresources

Engineering Applications of Artificial Intelligence
Ontology learning: revisted

Journal of Web Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

The problem of the resolution of the lexical ambiguity seems to be stuck because of the knowledge acquisition bottleneck. Therefore, it is worthwhile to investigate the possibility of using the Web as a lexical resource. This paper explores two attempts of using Web counts collected through a search engine. The first approach calculates the hits of each possible synonym of the noun to disambiguate together with the nouns of the context. In the second approach the disambiguation of a noun uses a modifier adjective as supporting evidence. A better precision than the baseline was obtained using adjective-noun pairs, even if with a low recall. A comprehensive set of weighting formulae for combining Web counts was investigated in order to give a complete picture of what are the various possibilities, and what are the formulae that work best. The comparison across different search engines was also useful: Web counts, and consequently disambiguation results, were almost identical. Moreover, the Web seems to be more effective than the WordNet Domains lexical resource if integrated rather than stand-alone.