Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A probabilistic model of information retrieval: development and comparative experiments Part 2
Information Processing and Management: an International Journal
Hybrid Pre-Query Term Expansion using Latent Semantic Analysis
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
A study of factors affecting the utility of implicit relevance feedback
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Active feedback in ad hoc information retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
The Effect of Weighted Term Frequencies on Probabilistic Latent Semantic Term Relationships
SPIRE '08 Proceedings of the 15th International Symposium on String Processing and Information Retrieval
The VLDB Journal — The International Journal on Very Large Data Bases
The Sensitivity of Latent Dirichlet Allocation for Information Retrieval
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Kernel latent semantic analysis using an information retrieval based kernel
Proceedings of the 18th ACM conference on Information and knowledge management
Query expansion for the language modelling framework using the naïve Bayes assumption
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
A Survey of Automatic Query Expansion in Information Retrieval
ACM Computing Surveys (CSUR)
A novel neighborhood based document smoothing model for information retrieval
Information Retrieval
Collaborative pseudo-relevance feedback
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Many queries on collections of text documents are too short to produce informative results. Automatic query expansion is a method of adding terms to the query without interaction from the user in order to obtain more refined results. In this investigation, we examine our novel automatic query expansion method using the probabilistic latent semantic thesaurus, which is based on probabilistic latent semantic analysis. We show how to construct the thesaurus by mining text documents for probabilistic term relationships, and we show that by using the latent semantic thesaurus, we can overcome many of the problems associated to latent semantic analysis on large document sets which were previously identified. Experiments using TREC document sets show that our term expansion method out performs the popular probabilistic pseudorelevance feedback method by 7.3%.