Regularized estimation of mixture models for robust pseudo-relevance feedback

Authors:
Tao Tao;ChengXiang Zhai
Affiliations:
University of Illinois at Urbana-Champaign, IL;University of Illinois at Urbana-Champaign, IL
Venue:
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2006

Citing 12
Cited 65

Query expansion using local and global document analysis

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Improving the effectiveness of information retrieval with local context analysis

ACM Transactions on Information Systems (TOIS)
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Model-based feedback in the language modeling approach to information retrieval

Proceedings of the tenth international conference on Information and knowledge management
Term-specific smoothing for the language modeling approach to information retrieval: the importance of a query term

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Effect of varying number of documents in blind feedback: analysis of the 2003 NRRC RIA workshop "bf_numdocs" experiment suite

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
The NRRC reliable information access (RIA) workshop

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Better than the real thing?: iterative pseudo-query processing using cluster-based language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A generative theory of relevance

A generative theory of relevance
Flexible pseudo-relevance feedback via selective sampling

ACM Transactions on Asian Language Information Processing (TALIP)

Topic sentiment mixture: modeling facets and opinions in weblogs

Proceedings of the 16th international conference on World Wide Web
Estimation and use of uncertainty in pseudo-relevance feedback

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Towards robust query expansion: model selection in the language modeling framework

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using probabilistic local feedback with application to multimedia retrieval

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
A new robust relevance model in the language model framework

Information Processing and Management: an International Journal
Opinion integration through semi-supervised topic modeling

Proceedings of the 17th international conference on World Wide Web
Learning to rank relational objects and its application to web search

Proceedings of the 17th international conference on World Wide Web
A cluster-based resampling method for pseudo-relevance feedback

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Selecting good expansion terms for pseudo-relevance feedback

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A few examples go a long way: constructing query models from elaborate query formulations

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A new probabilistic retrieval model based on the dirichlet compound multinomial distribution

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Improving Mobile Web-IR Using Access Concentration Sites in Search Results

WISE '08 Proceedings of the 9th international conference on Web Information Systems Engineering
Active relevance feedback for difficult queries

Proceedings of the 17th ACM conference on Information and knowledge management
Adaptive subjective triggers for opinionated document retrieval

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Statistical Language Models for Information Retrieval A Critical Review

Foundations and Trends in Information Retrieval
Crossing textual and visual content in different application scenarios

Multimedia Tools and Applications
Access concentration detection in click logs to improve mobile Web-IR

Information Sciences: an International Journal
Semi-supervised document retrieval

Information Processing and Management: an International Journal
Query Expansion Using External Evidence

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Using Contextual Information to Improve Search in Email Archives

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Query performance prediction for information retrieval based on covering topic score

Journal of Computer Science and Technology
Query dependent pseudo-relevance feedback based on wikipedia

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
A User Profiles Acquiring Approach Using Pseudo-Relevance Feedback

RSKT '09 Proceedings of the 4th International Conference on Rough Sets and Knowledge Technology
"A term is known by the company it keeps": On Selecting a Good Expansion Set in Pseudo-Relevance Feedback

ICTIR '09 Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory
Learning to Rank for Information Retrieval

Foundations and Trends in Information Retrieval
Context-based online medical terminology navigation

Expert Systems with Applications: An International Journal
Adaptive relevance feedback in information retrieval

Proceedings of the 18th ACM conference on Information and knowledge management
Reducing the risk of query expansion via robust constrained optimization

Proceedings of the 18th ACM conference on Information and knowledge management
A comparative study of methods for estimating query language models with pseudo feedback

Proceedings of the 18th ACM conference on Information and knowledge management
A generative blog post retrieval model that uses query expansion based on external collections

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Pseudo relevance feedback with incremental learning for high level feature detection

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Improving probabilistic information retrieval by modeling burstiness of words

Information Processing and Management: an International Journal
A statistical view of binned retrieval models

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Enhancing relevance models with adaptive passage retrieval

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Conceptual language models for domain-specific retrieval

Information Processing and Management: an International Journal
Multilingual PRF: english lends a helping hand

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
PageRank without hyperlinks: Structural reranking using links induced by language models

ACM Transactions on Information Systems (TOIS)
Multilingual pseudo-relevance feedback: performance study of assisting languages

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
A unified optimization framework for robust pseudo-relevance feedback algorithms

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Medical query generation by term-category correlation

Information Processing and Management: an International Journal
LambdaMerge: merging the results of query reformulations

Proceedings of the fourth ACM international conference on Web search and data mining
A boosting approach to improving pseudo-relevance feedback

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Social annotation in query expansion: a machine learning approach

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Query modeling for entity search based on terms, categories, and examples

ACM Transactions on Information Systems (TOIS)
Promoting divergent terms in the estimation of relevance models

ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
From "identical" to "similar": fusing retrieved lists based on inter-document similarities

Journal of Artificial Intelligence Research
The opposite of smoothing: a language model approach to ranking query-specific document clusters

Journal of Artificial Intelligence Research
Improving retrieval accuracy of difficult queries through generalizing negative document language models

Proceedings of the 20th ACM international conference on Information and knowledge management
Selecting related terms in query-logs using two-stage SimRank

Proceedings of the 20th ACM international conference on Information and knowledge management
Detecting levels of interest from spoken dialog with multistream prediction feedback and similarity based hierarchical fusion learning

SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
Exploiting real-time information retrieval in the microblogosphere

Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Adaptive query suggestion for difficult queries

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Proximity-based rocchio's model for pseudo relevance

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Exploiting External Collections for Query Expansion

ACM Transactions on the Web (TWEB)
Relevance Feedback Fusion via Query Expansion

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
An incremental approach to efficient pseudo-relevance feedback

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
A deterministic resampling method using overlapping document clusters for pseudo-relevance feedback

Information Processing and Management: an International Journal
Unsupervised latent concept modeling to identify query facets

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
A Theoretical Analysis of Pseudo-Relevance Feedback Models

Proceedings of the 2013 Conference on the Theory of Information Retrieval
A novel neighborhood based document smoothing model for information retrieval

Information Retrieval
A learning approach to optimizing exploration---exploitation tradeoff in relevance feedback

Information Retrieval
Collaborative pseudo-relevance feedback

Expert Systems with Applications: An International Journal
Bias-variance analysis in estimating true query model for information retrieval

Information Processing and Management: an International Journal
Hybrid pseudo-relevance feedback for microblog retrieval

Journal of Information Science
Improved Semantic Retrieval of Spoken Content by Document/Query Expansion with Random Walk Over Acoustic Similarity Graphs

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.01

Visualization

Abstract

Pseudo-relevance feedback has proven to be an effective strategy for improving retrieval accuracy in all retrieval models. However the performance of existing pseudo feedback methods is often affected significantly by some parameters, such as the number of feedback documents to use and the relative weight of original query terms; these parameters generally have to be set by trial-and-error without any guidance. In this paper, we present a more robust method for pseudo feedback based on statistical language models. Our main idea is to integrate the original query with feedback documents in a single probabilistic mixture model and regularize the estimation of the language model parameters in the model so that the information in the feedback documents can be gradually added to the original query. Unlike most existing feedback methods, our new method has no parameter to tune. Experiment results on two representative data sets show that the new method is significantly more robust than a state-of-the-art baseline language modeling approach for feedback with comparable or better retrieval accuracy.