MIRACLE at WebCLEF 2005: combining web specific and linguistic information

  • Authors:
  • Ángel Martínez-González;José Luis Martínez-Fernández;César de Pablo-Sánchez;Julio Villena-Román

  • Affiliations:
  • Universidad Politécnica de Madrid;Universidad Carlos III de Madrid;Universidad Carlos III de Madrid;Universidad Carlos III de Madrid

  • Venue:
  • CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes MIRACLE approach to WebCLEF. A set of independent indexes was constructed for each top level domain of the EuroGOV collection. Each index contains information extracted from the document, like URL, title, keywords, detected named entities or HTML headers. These indexes are queried to obtain partial document rankings, which are combined with various relative weights to test the value of each index. The final aim is to identify which index (or combination of them) is more relevant for a retrieval task, avoiding the construction of a full-text index.