A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
The open archives initiative: building a low-barrier interoperability framework
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Relevance based language models
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval of Short Documents from Discussion Forums
AI '02 Proceedings of the 15th Conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
A study of smoothing methods for language models applied to information retrieval
ACM Transactions on Information Systems (TOIS)
Using temporal profiles of queries for precision prediction
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Cluster-based retrieval using language models
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Better than the real thing?: iterative pseudo-query processing using cluster-based language models
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Language model information retrieval with document expansion
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
ACM Transactions on Information Systems (TOIS)
On the value of temporal information in information retrieval
ACM SIGIR Forum
Clusters, language models, and ad hoc information retrieval
ACM Transactions on Information Systems (TOIS)
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Similarity measures for short segments of text
ECIR'07 Proceedings of the 29th European conference on IR research
Information search and retrieval in microblogs
Journal of the American Society for Information Science and Technology
Incorporating query expansion and quality indicators in searching microblog posts
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Estimation methods for ranking recent information
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Cluster-based fusion of retrieved lists
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Improved stable retrieval in noisy collections
ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
Statistical source expansion for question answering
Proceedings of the 20th ACM international conference on Information and knowledge management
Answering General Time-Sensitive Queries
IEEE Transactions on Knowledge and Data Engineering
Investigating the statistical properties of user-generated documents
FQAS'11 Proceedings of the 9th international conference on Flexible Query Answering Systems
Cognitive temporal document priors
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
An LDA-smoothed relevance model for document expansion: a case study for spoken document retrieval
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
The Impacts of Structural Difference and Temporality of Tweets on Retrieval Effectiveness
ACM Transactions on Information Systems (TOIS)
Improving pseudo-relevance feedback via tweet selection
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Gem-based entity-knowledge maintenance
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Improving short text classification using public search engines
IUKM'13 Proceedings of the 2013 international conference on Integrated Uncertainty in Knowledge Modelling and Decision Making
Hi-index | 0.00 |
Collections containing a large number of short documents are becoming increasingly common. As these collections grow in number and size, providing effective retrieval of brief texts presents a significant research problem. We propose a novel approach to improving information retrieval (IR) for short texts based on aggressive document expansion. Starting from the hypothesis that short documents tend to be about a single topic, we submit documents as pseudo-queries and analyze the results to learn about the documents themselves. Document expansion helps in this context because short documents yield little in the way of term frequency information. However, as we show, the proposed technique helps us model not only lexical properties, but also temporal properties of documents. We present experimental results using a corpus of microblog (Twitter) data and a corpus of metadata records from a federated digital library. With respect to established baselines, results of these experiments show that applying our proposed document expansion method yields significant improvements in effectiveness. Specifically, our method improves the lexical representation of documents and the ability to let time influence retrieval.