New caching techniques for web search engines

Authors:
Mauricio Marin;Veronica Gil-Costa;Carlos Gomez-Pantoja
Affiliations:
Yahoo! Research Latin America, Santiago de Chile;Yahoo! Research Latin America, Santiago de Chile;Yahoo! Research Latin America, Santiago de Chile
Venue:
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Year:
2010

Citing 20
Cited 14

A bridging model for parallel computation

Communications of the ACM
Semantic cache mechanism for heterogeneous Web querying

WWW '99 Proceedings of the eighth international conference on World Wide Web
Modern Information Retrieval

Modern Information Retrieval
Answering Queries by Semantic Caches

DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
Semantic caching of Web queries

The VLDB Journal — The International Journal on Very Large Data Bases
Predictive caching and prefetching of query results in search engines

WWW '03 Proceedings of the 12th international conference on World Wide Web
Three-level caching for efficient query processing in large Web search engines

WWW '05 Proceedings of the 14th international conference on World Wide Web
Boosting the performance of Web search engines: Caching and prefetching query results by exploiting historical usage data

ACM Transactions on Information Systems (TOIS)
A pipelined architecture for distributed text query evaluation

Information Retrieval
High-performance distributed inverted files

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
The Case for Energy-Proportional Computing

Computer
Load-balancing and caching for collection selection architectures

Proceedings of the 2nd international conference on Scalable information systems
Design trade-offs for search engine caching

ACM Transactions on the Web (TWEB)
Inverted index compression and query processing with optimized document ordering

Proceedings of the 18th international conference on World wide web
Improved techniques for result caching in web search engines

Proceedings of the 18th international conference on World wide web
A Last-Resort Semantic Cache for Web Queries

SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
Location cache for web queries

Proceedings of the 18th ACM conference on Information and knowledge management
Tuning the capacity of search engines: Load-driven routing and incremental caching to reduce and balance the load

ACM Transactions on Information Systems (TOIS)
Learning to distribute queries into web search nodes

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
On caching search engine query results

Computer Communications

Performance evaluation of improved web search algorithms

VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
Timestamp-based result cache invalidation for web search engines

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
An evaluation of fault-tolerant query processing for web search engines

Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Scalable search platform: improving pipelined query processing for distributed full-text retrieval

Proceedings of the 21st international conference companion on World Wide Web
Distributed search based on self-indexed compressed text

Information Processing and Management: an International Journal
A five-level static cache architecture for web search engines

Information Processing and Management: an International Journal
Prefetching query results and its impact on search engines

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Cache-Based Query Processing for Search Engines

ACM Transactions on the Web (TWEB)
A fault-tolerant cache service for web search engines: RADIC evaluation

Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Impact of regionalization on performance of web search engine result caches

SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
A financial cost metric for result caching

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Second Chance: A Hybrid Approach for Dynamic Result Caching and Prefetching in Search Engines

ACM Transactions on the Web (TWEB)
Web search results caching service for structured P2P networks

Future Generation Computer Systems
Modelling Search Engines Performance Using Coloured Petri Nets

Fundamenta Informaticae - Application and Theory of Petri Nets and Concurrency, 2012

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a cache hierarchy that enables Web search engines to efficiently process user queries. The different caches in the hierarchy are used to store pieces of data which are useful to solve frequent queries. Cached items range from specific data such as query answers to generic data such as segments of index retrieved from secondary memory. The paper also presents a comparative study based on discrete-event simulation and bulk-synchronous parallelism. The studied performance metrics include overall query throughput, single-user query latency and power consumption. In all cases, the results show that the proposed cache hierarchy leads to better performance than a baseline approach built on state of the art caching techniques.