Practical NLP-Based Text Indexing

  • Authors:
  • Jesús Vilares Ferro;Francisco-Mario Barcala;Miguel A. Alonso;Jorge Graña Gil;Manuel Vilares Ferro

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • IBERAMIA 2002 Proceedings of the 8th Ibero-American Conference on AI: Advances in Artificial Intelligence
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider a set of natural language processing techniques based on finite-state technology that can be used to analyze huge amounts of texts. These techniques include an advanced tokenizer, a part-of-speech tagger that can manage ambiguous streams of words, a system for conflating words by means of derivational mechanisms, and a shallow parser to extract syntactic-dependency pairs. We propose to use these techniques in order to improve the performance of standard indexing engines.