Using the web information structure for retrieving web pages

  • Authors:
  • Mirna Adriani;Rama Pandugita

  • Affiliations:
  • Faculty of Computer Science, University of Indonesia, Depok, Indonesia;Faculty of Computer Science, University of Indonesia, Depok, Indonesia

  • Venue:
  • CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a report on our participation in the mixed monolingual web task of the 2005 Cross-Language Evaluation Forum (CLEF). We compared the result of web page retrieval based on the page content, page title, and a combination of page content and page title. The result shows that using the combination of page title resulted in the best retrieval performance compared to using only page content or page title. Taking into account the number of links referring to a web page and the depth of the directory path in its URL did not result in any significant improvement to the retrieval performance.