Reverted indexing for feedback and expansion

Authors:
Jeremy Pickens;Matthew Cooper;Gene Golovchinsky
Affiliations:
FX Palo Alto Laboratory, Inc., Palo Alto, CA, USA;FX Palo Alto Laboratory, Inc., Palo Alto, CA, USA;FX Palo Alto Laboratory, Inc., Palo Alto, CA, USA
Venue:
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Year:
2010

Citing 16
Cited 6

Concept based query expansion

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Query-based sampling of text databases

ACM Transactions on Information Systems (TOIS)
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
The effectiveness of query expansion for distributed information retrieval

Proceedings of the tenth international conference on Information and knowledge management
Probabilistic models of information retrieval based on measuring the divergence from randomness

ACM Transactions on Information Systems (TOIS)
Query expansion using associated queries

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Simplified similarity scoring using term ranks

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Query-driven document partitioning and collection selection

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Random walks on the click graph

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
On placing skips optimally in expectation

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Introduction to Information Retrieval

Introduction to Information Retrieval
Retrievability: an evaluation measure for higher order information access tasks

Proceedings of the 17th ACM conference on Information and knowledge management
Effective query expansion for federated search

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Tuning the capacity of search engines: Load-driven routing and incremental caching to reduce and balance the load

ACM Transactions on Information Systems (TOIS)
Term frequency normalisation tuning for BM25 and DFR models

ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research

Reranking search results for sparse queries

Proceedings of the 20th ACM international conference on Information and knowledge management
The future is in the past: designing for exploratory search

Proceedings of the 4th Information Interaction in Context Symposium
Looking ahead: query preview in exploratory search

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Relating retrievability, performance and length

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
From keywords to keyqueries: content descriptors for the web

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Bridging memory-based collaborative filtering and text retrieval

Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditional interactive information retrieval systems function by creating inverted lists, or term indexes. For every term in the vocabulary, a list is created that contains the documents in which that term occurs and its relative frequency within each document. Retrieval algorithms then use these term frequencies alongside other collection statistics to identify the matching documents for a query. In this paper, we turn the process around: instead of indexing documents, we index query result sets. First, queries are run through a chosen retrieval system. For each query, the resulting document IDs are treated as terms and the score or rank of the document is used as the frequency statistic. An index of documents retrieved by basis queries is created. We call this index a reverted index. With reverted indexes, standard retrieval algorithms can retrieve the matching queries (as results) for a set of documents (used as queries). These recovered queries can then be used to identify additional documents, or to aid the user in query formulation, selection, and feedback.