3D inverted index with cache sharing for web search engines

  • Authors:
  • Esteban Feuerstein;Veronica Gil-Costa;Mauricio Marin;Gabriel Tolosa;Ricardo Baeza-Yates

  • Affiliations:
  • Universidad Nacional de Buenos Aires, Argentina;Universidad Nacional de San Luis, Argentina,Yahoo! Labs Santiago, Chile;Universidad de Santiago de, Chile,Yahoo! Labs Santiago, Chile;Universidad Nacional de Lujan, Argentina, Universidad Nacional de Buenos Aires, Argentina;Yahoo! Labs Santiago, Chile

  • Venue:
  • Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Web search engines achieve efficient performance by partitioning and replicating the indexing data structure used to support query processing. Current practice simply partitions and replicates the text collection on the set of cluster processors and then constructs in each processor an index data structure. This paper proposes a different approach by constructing an index data structure that properly considers the fact that data is partitioned and replicated. This leads to a so-called 3D indexing strategy that outperforms current approaches. Performance is further boosted by introducing an application caching scheme devised to hold most frequently issued queries.