Optimization of restricted searches in web directories using hybrid data structures

  • Authors:
  • Fidel Cacheda;Victor Carneiro;Carmen Guerrero;Angel Viña

  • Affiliations:
  • Department of Information and Communications Technologies, Facultad de Informática, A Coruña, Spain;Department of Information and Communications Technologies, Facultad de Informática, A Coruña, Spain;Department of Information and Communications Technologies, Facultad de Informática, A Coruña, Spain;Department of Information and Communications Technologies, Facultad de Informática, A Coruña, Spain

  • Venue:
  • ECIR'03 Proceedings of the 25th European conference on IR research
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The need of efficient tools in order to manage, retrieve and filter the information in the WWW is clear. Web directories are taxonomies for the classification of Web documents. These kind of information retrieval systems present a specific type of search where the document collection is restricted to one area of the category graph. This paper introduces a specific data architecture for Web directories that improves the performance of restricted searches. That architecture is based on a hybrid data structure composed of an inverted file with multiple embedded signature files. Two variants are presented: hybrid architecture with total information and with partial information. This architecture has been analyzed by means of developing both variants to be compared with a basic model. The performance of the restricted queries was clearly improved, especially the hybrid model with partial information, which yielded a positive response under any load of the search system.