Application of Text Summarization techniques to the Geographical Information Retrieval task

  • Authors:
  • José M. Perea-Ortega;Elena Lloret;L. Alfonso UreñA-LóPez;Manuel Palomar

  • Affiliations:
  • SINAI Research Group, Computer Science Department, University of Jaén, E-23071 Jaén, Spain;NLP Research Group, Department of Software and Computing Systems, University of Alicante, E-03080 Alicante, Spain;SINAI Research Group, Computer Science Department, University of Jaén, E-23071 Jaén, Spain;NLP Research Group, Department of Software and Computing Systems, University of Alicante, E-03080 Alicante, Spain

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2013

Quantified Score

Hi-index 12.05

Visualization

Abstract

Automatic Text Summarization has been shown to be useful for Natural Language Processing tasks such as Question Answering or Text Classification and other related fields of computer science such as Information Retrieval. Since Geographical Information Retrieval can be considered as an extension of the Information Retrieval field, the generation of summaries could be integrated into these systems by acting as an intermediate stage, with the purpose of reducing the document length. In this manner, the access time for information searching will be improved, while at the same time relevant documents will be also retrieved. Therefore, in this paper we propose the generation of two types of summaries (generic and geographical) applying several compression rates in order to evaluate their effectiveness in the Geographical Information Retrieval task. The evaluation has been carried out using GeoCLEF as evaluation framework and following an Information Retrieval perspective without considering the geo-reranking phase commonly used in these systems. Although single-document summarization has not performed well in general, the slight improvements obtained for some types of the proposed summaries, particularly for those based on geographical information, made us believe that the integration of Text Summarization with Geographical Information Retrieval may be beneficial, and consequently, the experimental set-up developed in this research work serves as a basis for further investigations in this field.