The PUCRS NLP-group participation in CLEF2006: information retrieval based on linguistic resources

  • Authors:
  • Marco Gonzalez;Vera Lúcia Strube De Lima

  • Affiliations:
  • Grupo PLN, Faculdade de Informática, PUCRS, Alegre, Brazil;Grupo PLN, Faculdade de Informática, PUCRS, Alegre, Brazil

  • Venue:
  • CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents the 2006 participation of the PUCRS NLP-Group in the CLEF Monolingual Ad Hoc Task for Portuguese. We took part in this campaign using the TR+ Model, which is based on nominalization, binary lexical relations (BLR), Boolean queries, and the evidence concept. Our alternative strategy for lexical normalization, the nominalization, is to transform a word (adjective, verb, or adverb) into a semantically corresponding noun. BLRs identify relationships between nominalized terms and capture phrasal cohesion mechanisms, like those between subject and predicate, subject and object (direct or indirect), noun and adjective or verb and adverb. In our strategy, an index unit (a descriptor) may be a single term or a BLR, and we adopt the evidence concept: the descriptor weighting depends on the occurrence of phrasal cohesion mechanisms, besides depending on frequency of occurrence. We describe these features, which implement lexical normalization and term dependence in an information retrieval system based on linguistic resources.