A conceptual density-based approach for the disambiguation of toponyms

Authors:
Davide Buscaldi;Paulo Rosso
Affiliations:
Universidad Politécnica de Valencia, 46022 Valencia, Spain;Universidad Politécnica de Valencia, 46022 Valencia, Spain
Venue:
International Journal of Geographical Information Science
Year:
2008

Citing 14
Cited 10

GIPSY: automated geographic indexing of text documents

Journal of the American Society for Information Science - Special issue: spatial information
WordNet: a lexical database for English

Communications of the ACM
Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone

SIGDOC '86 Proceedings of the 5th annual international conference on Systems documentation
The Role of Conceptual Relation in Word Sense Disambiguation

NLDB'01 Proceedings of the 6th International Workshop on Applications of Natural Language to Information Systems
An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet

CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
Introduction to the special issue on word sense disambiguation: the state of the art

Computational Linguistics - Special issue on word sense disambiguation
Word sense disambiguation using Conceptual Density

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Bootstrapping toponym classifiers

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
A confidence-based framework for disambiguating geographic terms

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Disambiguating toponyms in news

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Using measures of semantic relatedness for word sense disambiguation

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Automatic noun sense disambiguation

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Using the wordnet ontology in the GeoCLEF geographical information retrieval task

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
A wordnet-based indexing technique for geographical information retrieval

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval

Map-based vs. knowledge-based toponym disambiguation

Proceedings of the 2nd international workshop on Geographic information retrieval
Semi-supervised Word Sense Disambiguation Using the Web as Corpus

CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Grounding toponyms in an Italian local news corpus

Proceedings of the 6th Workshop on Geographic Information Retrieval
Using GeoWordNet for geographical information retrieval

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
GeoTextMESS: result fusion with fuzzy Borda ranking in geographical information retrieval

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
GIRPharma: a geographic information retrieval approach to locate pharmacies on duty

Proceedings of the 1st International Conference and Exhibition on Computing for Geospatial Research & Application
Multifaceted toponym recognition for streaming news

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Approaches to disambiguating toponyms

SIGSPATIAL Special
Improving vertical geo/geo disambiguation by increasing geographical feature weights of places

Proceedings of the 2012 ACM Research in Applied Computation Symposium
Towards Platial Joins and Buffers in Place-Based GIS

Proceedings of The First ACM SIGSPATIAL International Workshop on Computational Models of Place

Quantified Score

Hi-index	0.00

Visualization

Abstract

Nowadays, a huge quantity of information is stored in digital format. A great portion of this information is constituted by textual and unstructured documents, where geographical references are usually given by means of place names. A common problem with textual information retrieval is represented by polysemous words, that is, words can have more than one sense. This problem is present also in the geographical domain: place names may refer to different locations in the world. In this paper we investigate the use of our word sense disambiguation technique in the geographical domain, with the aim of resolving ambiguous place names. Our technique is based on WordNet conceptual density. Due to the lack of a reference corpus tagged with WordNet senses, we carried out the experiments over a set of 1,210 place names extracted from the SemCor corpus that we named GeoSemCor and made publicly available. We compared our method with the most-frequent baseline and the enhanced-Lesk method, which previously has not been tested in large contexts. The results show that a better precision can be achieved by using a small context (phrase level), whereas a greater coverage can be obtained by using large contexts (document level). The proposed method should be tested with other corpora, due to the fact that our experiments evidenced the excessive bias towards the most-frequent sense of the GeoSemCor.