The use of phrases and structured queries in information retrieval
SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
Application of Spreading Activation Techniques in InformationRetrieval
Artificial Intelligence Review
Document language models, query models, and risk minimization for information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Model-based feedback in the language modeling approach to information retrieval
Proceedings of the tenth international conference on Information and knowledge management
A Markov random field model for term dependencies
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using random walk models
Proceedings of the 14th ACM international conference on Information and knowledge management
A syntactically-based query reformulation technique for information retrieval
Information Processing and Management: an International Journal
Discovering key concepts in verbose queries
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A general optimization framework for smoothing language models on graph structures
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Information Retrieval
Introduction to Information Retrieval
Regression Rank: Learning to Meet the Opportunity of Descriptive Queries
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Reducing long queries using query quality predictors
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Single document keyphrase extraction using neighborhood knowledge
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Learning concept importance using a weighted dependence model
Proceedings of the third ACM international conference on Web search and data mining
Evaluating verbose query processing techniques
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Exploring reductions for long web queries
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Query term ranking based on dependency parsing of verbose queries
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Improving verbose queries using subset distribution
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Query model refinement using word graphs
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Conundrums in unsupervised keyphrase extraction: making sense of the state-of-the-art
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Proceedings of the 20th international conference on World wide web
A quasi-synchronous dependence model for information retrieval
Proceedings of the 20th ACM international conference on Information and knowledge management
Modeling higher-order term dependencies in information retrieval using query hypergraphs
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Indexing Word Sequences for Ranked Retrieval
ACM Transactions on Information Systems (TOIS)
Hi-index | 0.00 |
Many recent and highly effective retrieval models for long queries use query reformulation methods that jointly optimize term weights and term selection. These methods learn using word context and global context but typically fail to capture query context. In this paper, we present a novel term ranking algorithm, PhRank, that extends work on Markov chain frameworks for query expansion to select compact and focused terms from within a query itself. This focuses queries so that one to five terms in an unweighted model achieve better retrieval effectiveness than weighted term selection models that use up to 30 terms. PhRank terms are also typically compact and contain 1-2 words compared to competing models that use query subsets up to 7 words long. PhRank captures query context with an affinity graph constructed using word co-occurrence in pseudo-relevant documents. A random walk of the graph is used for term ranking in combination with discrimination weights. Empirical evaluation using newswire and web collections demonstrates that performance of reformulated queries is significantly improved for long queries and at least as good for short, keyword queries compared to highly competitive information retrieval (IR) models.