Better than the real thing?: iterative pseudo-query processing using cluster-based language models

Authors:
Oren Kurland;Lillian Lee;Carmel Domshlak
Affiliations:
Cornell University, Ithaca NY and Carnegie Mellon University, Pittsburgh PA;Cornell University, Ithaca NY and Carnegie Mellon University, Pittsburgh PA;Technion, Haifa, Israel
Venue:
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2005

Citing 23
Cited 30

Incremental relevance feedback

SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Incremental relevance feedback for information filtering

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Using probabilistic models of document retrieval without relevance information

Readings in information retrieval
Using interdocument similarity information in document retrieval systems

Readings in information retrieval
Improving automatic query expansion

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Cluster-based language models for distributed retrieval

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Model-based feedback in the language modeling approach to information retrieval

Proceedings of the tenth international conference on Information and knowledge management
Optimal Mixture Models in IR

Proceedings of the 24th BCS-IRSG European Colloquium on IR Research: Advances in Information Retrieval
Building a filtering test collection for TREC 2002

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Error analysis of difficult TREC topics

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Language Modeling for Information Retrieval

Language Modeling for Information Retrieval
A survey on the use of relevance feedback for information access systems

The Knowledge Engineering Review
Cluster-based retrieval using language models

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Corpus structure, language models, and ad hoc information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A two-stage mixture model for pseudo feedback

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
The NRRC reliable information access (RIA) workshop

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Why current IR engines fail

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
PageRank without hyperlinks: structural re-ranking using links induced by language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Relevance models for topic detection and tracking

HLT '02 Proceedings of the second international conference on Human Language Technology Research

PageRank without hyperlinks: structural re-ranking using links induced by language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Respect my authority!: HITS without hyperlinks, utilizing cluster-based language models

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Regularized estimation of mixture models for robust pseudo-relevance feedback

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Adaptive document clustering based on query-based similarity

Information Processing and Management: an International Journal
Inferential language models for information retrieval

ACM Transactions on Asian Language Information Processing (TALIP)
Parsimonious translation models for information retrieval

Information Processing and Management: an International Journal
Towards a unified approach to document similarity search using manifold-ranking of blocks

Information Processing and Management: an International Journal
Exploring social annotations for information retrieval

Proceedings of the 17th international conference on World Wide Web
A cluster-based resampling method for pseudo-relevance feedback

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A few examples go a long way: constructing query models from elaborate query formulations

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Parsimonious relevance models

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Adapting information retrieval to query contexts

Information Processing and Management: an International Journal
Clusters, language models, and ad hoc information retrieval

ACM Transactions on Information Systems (TOIS)
Using Contextual Information to Improve Search in Email Archives

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Re-ranking search results using language models of query-specific clusters

Information Retrieval
Cluster-based query expansion

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Concept-based feature generation and selection for information retrieval

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
An improved feedback approach using relevant local posts for blog feed retrieval

Proceedings of the 18th ACM conference on Information and knowledge management
A generative blog post retrieval model that uses query expansion based on external collections

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Utilizing passage-based language models for ad hoc document retrieval

Information Retrieval
Utilizing passage-based language models for document retrieval

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Conceptual language models for domain-specific retrieval

Information Processing and Management: an International Journal
PageRank without hyperlinks: Structural reranking using links induced by language models

ACM Transactions on Information Systems (TOIS)
A Survey of Automatic Query Expansion in Information Retrieval

ACM Computing Surveys (CSUR)
Utilizing local evidence for blog feed search

Information Retrieval
Improving retrieval of short texts through document expansion

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Exploiting External Collections for Query Expansion

ACM Transactions on the Web (TWEB)
Thesaurus-based feedback to support mixed search and browsing environments

ECDL'07 Proceedings of the 11th European conference on Research and Advanced Technology for Digital Libraries
A deterministic resampling method using overlapping document clusters for pseudo-relevance feedback

Information Processing and Management: an International Journal
Improving pseudo-relevance feedback via tweet selection

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a novel approach to pseudo-feedback-based ad hoc retrieval that uses language models induced from both documents and clusters. First, we treat the pseudo-feedback documents produced in response to the original query as a set of pseudo-query that themselves can serve as input to the retrieval process. Observing that the documents returned in response to the pseudo-query can then act as pseudo-query for subsequent rounds, we arrive at a formulation of pseudo-query-based retrieval as an iterative process. Experiments show that several concrete instantiations of this idea, when applied in conjunction with techniques designed to heighten precision, yield performance results rivaling those of a number of previously-proposed algorithms, including the standard language-modeling approach. The use of cluster-based language models is a key contributing factor to our algorithms' success.