SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Relevance feedback and inference networks
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic thesaurus construction using Bayesian networks
Information Processing and Management: an International Journal - Special issue: history of information science
Query expansion using local and global document analysis
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
A cooccurrence-based thesaurus and two applications to information retrieval
Information Processing and Management: an International Journal
A corpus analysis approach for automatic query expansion
CIKM '97 Proceedings of the sixth international conference on Information and knowledge management
A comparison of collocation-based similarity measures in query expansion
Information Processing and Management: an International Journal
Real life, real users, and real needs: a study and analysis of user queries on the web
Information Processing and Management: an International Journal
Query expansion using heterogeneous thesauri
Information Processing and Management: an International Journal
A corpus-based approach to comparative evaluation of statistical term association measures
Journal of the American Society for Information Science and Technology
WISE '00 Proceedings of the First International Conference on Web Information Systems Engineering (WISE'00)-Volume 1 - Volume 1
Tuning before feedback: combining ranking discovery and blind feedback for robust retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Re-examining the effects of adding relevance information in a relevance feedback environment
Information Processing and Management: an International Journal
Conceptual language models for domain-specific retrieval
Information Processing and Management: an International Journal
Towards an ontology-based retrieval of UML Class Diagrams
Information and Software Technology
Improving retrieval performance with the combination of thesauri and automatic relevance feedback
ICMLC'05 Proceedings of the 4th international conference on Advances in Machine Learning and Cybernetics
Prototype system for pursuing firm's core capability
Information Systems Frontiers
Hi-index | 0.00 |
This paper examines the factors affecting the performance of global query expansion based on term co-occurrence data and suggests a way to maximize the retrieval effectiveness. Major parameters to be optimized through experiments are term similarity measure and the weighting scheme of additional terms. The evaluation of four similarity measures tested in query expansion reveal that mutual information and Yule's Y, which emphasize low frequency terms, achieve better performance than cosine and Jaccard coefficients that have the reverse tendency. In the evaluation of three weighting schemes, similarity weight performs well only with short queries, whereas fixed weights of approximately 0.5 and similarity rank weights were effective with queries of any length. Furthermore, the optimal similarity rank weight achieving the best overall performance seems to be the least affected by test collections and the number of additional terms. For the efficiency of retrieval, the number of additional terms needs not exceed 70 in our test collections, but the optimal number may vary according to the characteristics of the similarity measure employed.