Improving pseudo-relevance feedback via tweet selection

Authors:
Taiki Miyanishi;Kazuhiro Seki;Kuniaki Uehara
Affiliations:
Kobe University, Kobe City, Japan;Kobe University, Kobe City, Japan;Kobe University, Kobe City, Japan
Venue:
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Year:
2013

Citing 34
Cited 0

Towards interactive query expansion

SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
Improving automatic query expansion

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Model-based feedback in the language modeling approach to information retrieval

Proceedings of the tenth international conference on Information and knowledge management
Cumulated gain-based evaluation of IR techniques

ACM Transactions on Information Systems (TOIS)
Time-based language models

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
A study of smoothing methods for language models applied to information retrieval

ACM Transactions on Information Systems (TOIS)
Cluster-based retrieval using language models

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Corpus structure, language models, and ad hoc information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Better than the real thing?: iterative pseudo-query processing using cluster-based language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
LDA-based document models for ad-hoc retrieval

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Language model information retrieval with document expansion

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Temporal profiles of queries

ACM Transactions on Information Systems (TOIS)
Term feedback for information retrieval with language models

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Latent concept expansion using markov random fields

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A comparison of statistical significance tests for information retrieval evaluation

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Selecting good expansion terms for pseudo-relevance feedback

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Discovering key concepts in verbose queries

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Integration of news content into web results

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Cluster-based query expansion

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Positional relevance model for pseudo-relevance feedback

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Incorporating query expansion and quality indicators in searching microblog posts

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Estimation methods for ranking recent information

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Time-based relevance models

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Answering General Time-Sensitive Queries

IEEE Transactions on Knowledge and Data Engineering
Exploiting real-time information retrieval in the microblogosphere

Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Adaptive temporal query modeling

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Improving retrieval of short texts through document expansion

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Structured event retrieval over microblog archives

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Survival analysis for freshness in microblogging search

Proceedings of the 21st ACM international conference on Information and knowledge management
Temporal models for microblogs

Proceedings of the 21st ACM international conference on Information and knowledge management
Cognitive temporal document priors

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Combining recency and topic-dependent temporal variation for microblog search

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Query expansion methods using pseudo-relevance feedback have been shown effective for microblog search because they can solve vocabulary mismatch problems often seen in searching short documents such as Twitter messages (tweets), which are limited to 140 characters. Pseudo-relevance feedback assumes that the top ranked documents in the initial search results are relevant and that they contain topic-related words appropriate for relevance feedback. However, those assumptions do not always hold in reality because the initial search results often contain many irrelevant documents. In such a case, only a few of the suggested expansion words may be useful with many others being useless or even harmful. To overcome the limitation of pseudo-relevance feedback for microblog search, we propose a novel query expansion method based on two-stage relevance feedback that models search interests by manual tweet selection and integration of lexical and temporal evidence into its relevance model. Our experiments using a corpus of microblog data (the Tweets2011 corpus) demonstrate that the proposed two-stage relevance feedback approaches considerably improve search result relevance over almost all topics.