New caching techniques for web search engines

  • Authors:
  • Mauricio Marin;Veronica Gil-Costa;Carlos Gomez-Pantoja

  • Affiliations:
  • Yahoo! Research Latin America, Santiago de Chile;Yahoo! Research Latin America, Santiago de Chile;Yahoo! Research Latin America, Santiago de Chile

  • Venue:
  • Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a cache hierarchy that enables Web search engines to efficiently process user queries. The different caches in the hierarchy are used to store pieces of data which are useful to solve frequent queries. Cached items range from specific data such as query answers to generic data such as segments of index retrieved from secondary memory. The paper also presents a comparative study based on discrete-event simulation and bulk-synchronous parallelism. The studied performance metrics include overall query throughput, single-user query latency and power consumption. In all cases, the results show that the proposed cache hierarchy leads to better performance than a baseline approach built on state of the art caching techniques.