SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A hidden Markov model information retrieval system
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Relevance based language models
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Modern Information Retrieval
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
The NRRC reliable information access (RIA) workshop
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Better than the real thing?: iterative pseudo-query processing using cluster-based language models
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A Markov random field model for term dependencies
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Improving the estimation of relevance models using large external corpora
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Regularized estimation of mixture models for robust pseudo-relevance feedback
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using probabilistic local feedback with application to multimedia retrieval
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Retrieval and feedback models for blog feed search
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A few examples go a long way: constructing query models from elaborate query formulations
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Information Retrieval
Introduction to Information Retrieval
Improved query difficulty prediction for the web
Proceedings of the 17th ACM conference on Information and knowledge management
Using coherence-based measures to predict query difficulty
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Supervised query modeling using wikipedia
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Finding people and their utterances in social media
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Generating focused topic-specific sentiment lexicons
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Linking online news and social media
Proceedings of the fourth ACM international conference on Web search and data mining
Blog feed search with a post index
Information Retrieval
External query reformulation for text-based image retrieval
SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
Credibility-inspired ranking for blog post retrieval
Information Retrieval
Using temporal bursts for query modeling
Information Retrieval
Hi-index | 0.00 |
User generated content is characterized by short, noisy documents, with many spelling errors and unexpected language usage. To bridge the vocabulary gap between the user's information need and documents in a specific user generated content environment, the blogosphere, we apply a form of query expansion, i.e., adding and reweighing query terms. Since the blogosphere is noisy, query expansion on the collection itself is rarely effective but external, edited collections are more suitable. We propose a generative model for expanding queries using external collections in which dependencies between queries, documents, and expansion documents are explicitly modeled. Different instantiations of our model are discussed and make different (in)dependence assumptions. Results using two external collections (news and Wikipedia) show that external expansion for retrieval of user generated content is effective; besides, conditioning the external collection on the query is very beneficial, and making candidate expansion terms dependent on just the document seems sufficient.