Query dependent pseudo-relevance feedback based on wikipedia

Authors:
Yang Xu;Gareth J.F. Jones;Bin Wang
Affiliations:
Institute of Computing,Chinese Academy of Sciences, Beijing, China;Dublin City University, Dublin, Ireland;Institute of Computing,Chinese Academy of Sciences, Beijing, China
Venue:
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Year:
2009

Citing 22
Cited 43

Improving two-stage ad-hoc retrieval for short queries

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Improving the effectiveness of information retrieval with local context analysis

ACM Transactions on Information Systems (TOIS)
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Model-based feedback in the language modeling approach to information retrieval

Proceedings of the tenth international conference on Information and knowledge management
Clustering Algorithms

Clustering Algorithms
Query Expansion by Mining User Logs

IEEE Transactions on Knowledge and Data Engineering
Simple BM25 extension to multiple weighted fields

Proceedings of the thirteenth ACM international conference on Information and knowledge management
A framework for selective query expansion

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Learning to estimate query difficulty: including applications to missing content detection and distributed information retrieval

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Concept-based interactive query expansion

Proceedings of the 14th ACM international conference on Information and knowledge management
Regularized estimation of mixture models for robust pseudo-relevance feedback

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Combining fields for query expansion and adaptive query expansion

Information Processing and Management: an International Journal
Personalized query expansion for the web

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Latent concept expansion using markov random fields

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Improving weak ad-hoc queries using wikipedia asexternal corpus

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A knowledge-based search engine powered by wikipedia

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Fact Discovery in Wikipedia

WI '07 Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence
A cluster-based resampling method for pseudo-relevance feedback

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Selecting good expansion terms for pseudo-relevance feedback

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval and feedback models for blog feed search

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Ambiguous queries: test collections need more sense

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

Exploiting time-based synonyms in searching document archives

Proceedings of the 10th annual joint conference on Digital libraries
Multilingual PRF: english lends a helping hand

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Effective query expansion with the resistance distance based term similarity metric

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Supervised query modeling using wikipedia

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Multilingual pseudo-relevance feedback: performance study of assisting languages

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Preliminary study into query translation for patent retrieval

PaIR '10 Proceedings of the 3rd international workshop on Patent information retrieval
Query expansion based on clustered results

Proceedings of the VLDB Endowment
Social annotation in query expansion: a machine learning approach

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Enriching document representation via translation for improved monolingual information retrieval

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Exploring term temporality for pseudo-relevance feedback

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Query expansion based on a semantic graph model

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
LIA at INEX 2010 book track

INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
Improving query expansion for image retrieval via saliency and picturability

CLEF'11 Proceedings of the Second international conference on Multilingual and multimodal information access evaluation
External query reformulation for text-based image retrieval

SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
Interactive sense feedback for difficult queries

Proceedings of the 20th ACM international conference on Information and knowledge management
A Survey of Automatic Query Expansion in Information Retrieval

ACM Computing Surveys (CSUR)
Effective query formulation with multiple information sources

Proceedings of the fifth ACM international conference on Web search and data mining
Promoting ranking diversity for biomedical information retrieval using wikipedia

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
QAque: faceted query expansion techniques for exploratory search using community QA resources

Proceedings of the 21st international conference companion on World Wide Web
Query phrase expansion using wikipedia in patent class search

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
A web 2.0 approach for organizing search results using wikipedia

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Wikipedia-based smoothing for enhancing text clustering

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Reformulation of Telugu web query using word semantic relationships

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Exploiting External Collections for Query Expansion

ACM Transactions on the Web (TWEB)
Entity based Q&A retrieval

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Generating event storylines from microblogs

Proceedings of the 21st ACM international conference on Information and knowledge management
Automatic query expansion based on tag recommendation

Proceedings of the 21st ACM international conference on Information and knowledge management
Selecting expansion terms as a set via integer linear programming

Proceedings of the 21st ACM international conference on Information and knowledge management
Robust query rewriting using anchor data

Proceedings of the sixth ACM international conference on Web search and data mining
Query expansion powered by wikipedia hyperlinks

AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
DIKEA: domain-independent keyphrase extraction algorithm

AI'12 Proceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence
Modeling reformulation using query distributions

ACM Transactions on Information Systems (TOIS)
An incremental approach to efficient pseudo-relevance feedback

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Query expansion using path-constrained random walks

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Multi-step classification approaches to cumulative citation recommendation

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Retrieving opinions from discussion forums

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Social semantic query expansion

ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Constructing query-specific knowledge bases

Proceedings of the 2013 workshop on Automated knowledge base construction
Leveraging related entities for knowledge base acceleration

Proceedings of the 4th international workshop on Web-scale knowledge representation retrieval and reasoning
Wikipedia-based semantic query enrichment

Proceedings of the sixth international workshop on Exploiting semantic annotations in information retrieval
Collaborative pseudo-relevance feedback

Expert Systems with Applications: An International Journal
Improving short text classification using public search engines

IUKM'13 Proceedings of the 2013 international conference on Integrated Uncertainty in Knowledge Modelling and Decision Making
Hybrid pseudo-relevance feedback for microblog retrieval

Journal of Information Science

Quantified Score

Hi-index	0.00

Visualization

Abstract

Pseudo-relevance feedback (PRF) via query-expansion has been proven to be e®ective in many information retrieval (IR) tasks. In most existing work, the top-ranked documents from an initial search are assumed to be relevant and used for PRF. One problem with this approach is that one or more of the top retrieved documents may be non-relevant, which can introduce noise into the feedback process. Besides, existing methods generally do not take into account the significantly different types of queries that are often entered into an IR system. Intuitively, Wikipedia can be seen as a large, manually edited document collection which could be exploited to improve document retrieval effectiveness within PRF. It is not obvious how we might best utilize information from Wikipedia in PRF, and to date, the potential of Wikipedia for this task has been largely unexplored. In our work, we present a systematic exploration of the utilization of Wikipedia in PRF for query dependent expansion. Specifically, we classify TREC topics into three categories based on Wikipedia: 1) entity queries, 2) ambiguous queries, and 3) broader queries. We propose and study the effectiveness of three methods for expansion term selection, each modeling the Wikipedia based pseudo-relevance information from a different perspective. We incorporate the expansion terms into the original query and use language modeling IR to evaluate these methods. Experiments on four TREC test collections, including the large web collection GOV2, show that retrieval performance of each type of query can be improved. In addition, we demonstrate that the proposed method out-performs the baseline relevance model in terms of precision and robustness.