The GeoTALP-IR system at GeoCLEF 2005: experiments using a QA-Based IR system, linguistic analysis, and a geographical thesaurus

  • Authors:
  • Daniel Ferrés;Alicia Ageno;Horacio Rodríguez

  • Affiliations:
  • TALP Research Center, Software Department, Universitat Politècnica de Catalunya, Barcelona, Spain;TALP Research Center, Software Department, Universitat Politècnica de Catalunya, Barcelona, Spain;TALP Research Center, Software Department, Universitat Politècnica de Catalunya, Barcelona, Spain

  • Venue:
  • CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes GeoTALP-IR system, a Geographical Information Retrieval (GIR) system. The system is described and evaluated in the context of our participation in the CLEF 2005 GeoCLEF Monolingual English task. The GIR system is based on Lucene and uses a modified version of the Passage Retrieval module of the TALP Question Answering (QA) system presented at CLEF 2004 and TREC 2004 QA evaluation tasks. We designed a Keyword Selection algorithm based on a Linguistic and Geographical Analysis of the topics. A Geographical Thesaurus (GT) has been built using a set of publicly available Geographical Gazetteers and a Geographical Ontology. Our experiments show that the use of a Geographical Thesaurus for Geographical Indexing and Retrieval has improved the performance of our GIR system.