Caching query-biased snippets for efficient retrieval

Authors:
Diego Ceccarelli;Claudio Lucchese;Salvatore Orlando;Raffaele Perego;Fabrizio Silvestri
Affiliations:
Università di Pisa;ISTI-CNR, Pisa;Università Ca' Foscari, Venezia;ISTI-CNR, Pisa;ISTI-CNR, Pisa
Venue:
Proceedings of the 14th International Conference on Extending Database Technology
Year:
2011

Citing 13
Cited 3

New Methods in Automatic Extracting

Journal of the ACM (JACM)
Pruning long documents for distributed information retrieval

Proceedings of the eleventh international conference on Information and knowledge management
Introduction to the special issue on summarization

Computational Linguistics - Summarization
Predictive caching and prefetching of query results in search engines

WWW '03 Proceedings of the 12th international conference on World Wide Web
Web Search for a Planet: The Google Cluster Architecture

IEEE Micro
Boosting the performance of Web search engines: Caching and prefetching query results by exploiting historical usage data

ACM Transactions on Information Systems (TOIS)
Fast generation of result snippets in web search

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Design trade-offs for search engine caching

ACM Transactions on the Web (TWEB)
Improved techniques for result caching in web search engines

Proceedings of the 18th international conference on World wide web
Document Compaction for Efficient Query Biased Snippet Generation

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
The automatic creation of literature abstracts

IBM Journal of Research and Development
Mining Query Logs: Turning Search Usage Data into Knowledge

Foundations and Trends in Information Retrieval
On caching search engine query results

Computer Communications

Towards efficient similar sentences extraction

IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
The impact of solid state drive on search engine cache management

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Second Chance: A Hybrid Approach for Dynamic Result Caching and Prefetching in Search Engines

ACM Transactions on the Web (TWEB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Web Search Engines' result pages contain references to the top-k documents relevant for the query submitted by a user. Each document is represented by a title, a snippet and a URL. Snippets, i.e. short sentences showing the portions of the document being relevant to the query, help users to select the most interesting results. The snippet generation process is very expensive, since it may require to access a number of documents for each issued query. We assert that caching, a popular technique used to enhance performance at various levels of any computing systems, can be very effective in this context. We design and experiment several cache organizations, and we introduce the concept of supersnippet, that is the set of sentences in a document that are more likely to answer future queries. We show that supersnippets can be built by exploiting query logs, and that in our experiments a supersnippet cache answers up to 62% of the requests, remarkably outperforming other caching approaches.