Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Inferring probability of relevance using the method of logistic regression
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Improving two-stage ad-hoc retrieval for short queries
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Cumulated gain-based evaluation of IR techniques
ACM Transactions on Information Systems (TOIS)
Optimizing search engines using clickthrough data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Retrieval evaluation with incomplete information
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A Markov random field model for term dependencies
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Linear feature-based models for information retrieval
Information Retrieval
A support vector method for optimizing average precision
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
An exploration of proximity measures in information retrieval
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Latent concept expansion using markov random fields
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Incorporating term dependency in the dfr framework
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Unsupervised query segmentation using generative language models and wikipedia
Proceedings of the 17th international conference on World Wide Web
Investigation of partial query proximity in web search
Proceedings of the 17th international conference on World Wide Web
Selecting good expansion terms for pseudo-relevance feedback
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A unified and discriminative model for query refinement
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Discovering key concepts in verbose queries
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Learning in a pairwise term-term proximity framework for information retrieval
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
An improved markov random field model for supporting verbose queries
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Reducing long queries using query quality predictors
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Two-stage query segmentation for information retrieval
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Learning to Rank for Information Retrieval
Foundations and Trends in Information Retrieval
Boosting web retrieval through query operations
ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Exploring reductions for long web queries
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Ranking under temporal constraints
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Improved latent concept expansion using hierarchical markov random fields
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Improving verbose queries using subset distribution
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Using various term dependencies according to their utilities
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Quality-biased ranking of web documents
Proceedings of the fourth ACM international conference on Web search and data mining
Key concepts identification and weighting in search engine queries
APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
A cascade ranking model for efficient ranked retrieval
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Parameterized concept weighting in verbose queries
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Modeling subset distributions for verbose queries
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Learning to rank under tight budget constraints
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Query term ranking based on search results overlap
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
A quasi-synchronous dependence model for information retrieval
Proceedings of the 20th ACM international conference on Information and knowledge management
Effective query formulation with multiple information sources
Proceedings of the fifth ACM international conference on Web search and data mining
A field relevance model for structured document retrieval
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
A log-logistic model-based interpretation of TF normalization of BM25
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
LePrEF: Learn to precompute evidence fusion for efficient query evaluation
Journal of the American Society for Information Science and Technology
Generating reformulation trees for complex queries
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Extending BM25 with multiple query operators
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Modeling higher-order term dependencies in information retrieval using query hypergraphs
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Generating queries from user-selected text
Proceedings of the 4th Information Interaction in Context Symposium
Discovering relevant features for effective query formulation
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Learning lexicon models from search logs for query expansion
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Harvesting visual concepts for image search with complex queries
Proceedings of the 20th ACM international conference on Multimedia
Modeling reformulation using query distributions
ACM Transactions on Information Systems (TOIS)
Two-Stage learning to rank for information retrieval
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Compact query term selection using topically related text
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Mining pure high-order word associations via information geometry for information retrieval
ACM Transactions on Information Systems (TOIS)
Map search via a factor graph model
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Constructing query-specific knowledge bases
Proceedings of the 2013 workshop on Automated knowledge base construction
BNCOD'13 Proceedings of the 29th British National conference on Big Data
Towards Concept-Based Translation Models Using Search Logs for Query Expansion
Proceedings of the 21st ACM international conference on Information and knowledge management
Diversified top-k graph pattern matching
Proceedings of the VLDB Endowment
Improving search relevance for short queries in community question answering
Proceedings of the 7th ACM international conference on Web search and data mining
Indexing Word Sequences for Ranked Retrieval
ACM Transactions on Information Systems (TOIS)
Document vector representations for feature extraction in multi-stage document ranking
Information Retrieval
Semantic concept-enriched dependence model for medical information retrieval
Journal of Biomedical Informatics
Hi-index | 0.00 |
Modeling query concepts through term dependencies has been shown to have a significant positive effect on retrieval performance, especially for tasks such as web search, where relevance at high ranks is particularly critical. Most previous work, however, treats all concepts as equally important, an assumption that often does not hold, especially for longer, more complex queries. In this paper, we show that one of the most effective existing term dependence models can be naturally extended by assigning weights to concepts. We demonstrate that the weighted dependence model can be trained using existing learning-to-rank techniques, even with a relatively small number of training queries. Our study compares the effectiveness of both endogenous (collection-based) and exogenous (based on external sources) features for determining concept importance. To test the weighted dependence model, we perform experiments on both publicly available TREC corpora and a proprietary web corpus. Our experimental results indicate that our model consistently and significantly outperforms both the standard bag-of-words model and the unweighted term dependence model, and that combining endogenous and exogenous features generally results in the best retrieval effectiveness.