Discovering key concepts in verbose queries

Authors:
Michael Bendersky;W. Bruce Croft
Affiliations:
University of Massachusetts, Amherst, MA, USA;University of Massachusetts, Amherst, MA, USA
Venue:
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2008

Citing 24
Cited 77

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
TREC and TIPSTER experiments with INQUERY

TREC-2 Proceedings of the second conference on Text retrieval conference
Query expansion using local and global document analysis

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
An Algorithm that Learns What‘s in a Name

Machine Learning - Special issue on natural language learning
Using clustering and SuperConcepts within SMART: TREC 6

Information Processing and Management: an International Journal - The sixth text REtrieval conference (TREC-6)
Term-specific smoothing for the language modeling approach to information retrieval: the importance of a query term

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Learning Algorithms for Keyphrase Extraction

Information Retrieval
Document Clustering and Cluster Topic Extraction in Multilingual Corpora

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Domain-Specific Keyphrase Extraction

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Statistics-Based Summarization - Step One: Sentence Compression

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Language Modeling for Information Retrieval

Language Modeling for Information Retrieval
Cluster-based retrieval using language models

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Corpus structure, language models, and ad hoc information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Beyond lexical units: enriching wordnets with phrasets

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
A Markov random field model for term dependencies

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using random walk models

Proceedings of the 14th ACM international conference on Information and knowledge management
Improved automatic keyword extraction given more linguistic knowledge

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Finding advertising keywords on web pages

Proceedings of the 15th international conference on World Wide Web
LDA-based document models for ad-hoc retrieval

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Latent concept expansion using markov random fields

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A study of Poisson query generation model for information retrieval

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Query performance prediction in web search environments

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

Joke retrieval: recognizing the same joke told differently

Proceedings of the 17th ACM conference on Information and knowledge management
The effect of title term suggestion on e-commerce sites

Proceedings of the 10th ACM workshop on Web information and data management
Analysis of long queries in a large scale search log

Proceedings of the 2009 workshop on Web Search Click Data
Regression Rank: Learning to Meet the Opportunity of Descriptive Queries

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
An improved markov random field model for supporting verbose queries

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Reducing long queries using query quality predictors

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
An Effective Approach to Verbose Queries Using a Limited Dependencies Language Model

ICTIR '09 Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory
Syntactic Query Models for Restatement Retrieval

SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
A Query Substitution-Search Result Refinement Approach for Long Query Web Searches

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
A term dependency-based approach for query terms ranking

Proceedings of the 18th ACM conference on Information and knowledge management
A query model based on normalized log-likelihood

Proceedings of the 18th ACM conference on Information and knowledge management
So many topics, so little time

ACM SIGIR Forum
Selecting Effective Terms for Query Formulation

AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
Learning Semantic Query Suggestions

ISWC '09 Proceedings of the 8th International Semantic Web Conference
Learning concept importance using a weighted dependence model

Proceedings of the third ACM international conference on Web search and data mining
Query reformulation using anchor text

Proceedings of the third ACM international conference on Web search and data mining
Evaluating verbose query processing techniques

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Exploring reductions for long web queries

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
To translate or not to translate?

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Query term ranking based on dependency parsing of verbose queries

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
An analysis of queries intended to search information for children

Proceedings of the third symposium on Information interaction in context
Mining Historic Query Trails to Label Long and Rare Search Engine Queries

ACM Transactions on the Web (TWEB)
Extended Boolean retrieval for systematic biomedical reviews

ACSC '10 Proceedings of the Thirty-Third Australasian Conferenc on Computer Science - Volume 102
Eddi: interactive topic-based browsing of social status streams

UIST '10 Proceedings of the 23nd annual ACM symposium on User interface software and technology
Term necessity prediction

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Using the past to score the present: extending term weighting models through revision history analysis

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Language pyramid and multi-scale text analysis

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
PROSPECT: a system for screening candidates for recruitment

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Improving verbose queries using subset distribution

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Automatically extracting information needs from complex clinical questions

Journal of Biomedical Informatics
Peaks and persistence: modeling the shape of microblog conversations

Proceedings of the ACM 2011 conference on Computer supported cooperative work
A community question-answering refinement system

Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Key concepts identification and weighting in search engine queries

APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
Introducing the user-over-ranking hypothesis

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
A source independent framework for research paper recommendation

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Parameterized concept weighting in verbose queries

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Multimedia answering: enriching text QA with media information

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Synthesizing high utility suggestions for rare web search queries

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Modeling subset distributions for verbose queries

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Query term ranking based on search results overlap

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Applying the user-over-ranking hypothesis to query formulation

ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
A quasi-synchronous dependence model for information retrieval

Proceedings of the 20th ACM international conference on Information and knowledge management
Reranking search results for sparse queries

Proceedings of the 20th ACM international conference on Information and knowledge management
Efficiency optimizations for interpolating subqueries

Proceedings of the 20th ACM international conference on Information and knowledge management
Evidence finding using a collection of books

Proceedings of the 4th ACM workshop on Online books, complementary social media and crowdsourcing
Known-item video search via query-to-modality mapping

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Mapping queries to the Linking Open Data cloud: A case study using DBpedia

Web Semantics: Science, Services and Agents on the World Wide Web
Effective query formulation with multiple information sources

Proceedings of the fifth ACM international conference on Web search and data mining
Evaluating search in personal social media collections

Proceedings of the fifth ACM international conference on Web search and data mining
Query aspect based term weighting regularization in information retrieval

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Machine learning for query formulation in question answering

Natural Language Engineering
Rewriting null e-commerce queries to recommend products

Proceedings of the 21st international conference companion on World Wide Web
An evaluation of classification models for question topic categorization

Journal of the American Society for Information Science and Technology
Interactive search support for difficult web queries

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
An Aspect Query Language Model Based On Query Decomposition And High-Order Contextual Term Associations

Computational Intelligence
Generating reformulation trees for complex queries

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Modeling higher-order term dependencies in information retrieval using query hypergraphs

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Generating queries from user-selected text

Proceedings of the 4th Information Interaction in Context Symposium
A discriminative model for query spelling correction with latent structural SVM

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Harvesting visual concepts for image search with complex queries

Proceedings of the 20th ACM international conference on Multimedia
Supporting factual statements with evidence from the web

Proceedings of the 21st ACM international conference on Information and knowledge management
Role-explicit query identification and intent role annotation

Proceedings of the 21st ACM international conference on Information and knowledge management
Sketch-based indexing of n-words

Proceedings of the 21st ACM international conference on Information and knowledge management
A scalable approach for performing proximal search for verbose patent search queries

Proceedings of the 21st ACM international conference on Information and knowledge management
Hidden markov model for term weighting in verbose queries

CLEF'12 Proceedings of the Third international conference on Information Access Evaluation: multilinguality, multimodality, and visual analytics
Graph-based concept weighting for medical information retrieval

Proceedings of the Seventeenth Australasian Document Computing Symposium
LinkedVis: exploring social and semantic career recommendations

Proceedings of the 2013 international conference on Intelligent user interfaces
Modeling reformulation using query distributions

ACM Transactions on Information Systems (TOIS)
Semantic search log k-anonymization with generalized k-cores of query concept graph

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Compact query term selection using topically related text

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Mining pure high-order word associations via information geometry for information retrieval

ACM Transactions on Information Systems (TOIS)
Improving pseudo-relevance feedback via tweet selection

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Modeling semantic and behavioral relations for query suggestion

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Detecting topic labels for tweets by matching features from pseudo-relevance feedback

AusDM '12 Proceedings of the Tenth Australasian Data Mining Conference - Volume 134
Indexing Word Sequences for Ranked Retrieval

ACM Transactions on Information Systems (TOIS)
Detecting verbose queries and improving information retrieval

Information Processing and Management: an International Journal
Semantic concept-enriched dependence model for medical information retrieval

Journal of Biomedical Informatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Current search engines do not, in general, perform well with longer, more verbose queries. One of the main issues in processing these queries is identifying the key concepts that will have the most impact on effectiveness. In this paper, we develop and evaluate a technique that uses query-dependent, corpus-dependent, and corpus-independent features for automatic extraction of key concepts from verbose queries. We show that our method achieves higher accuracy in the identification of key concepts than standard weighting methods such as inverse document frequency. Finally, we propose a probabilistic model for integrating the weighted key concepts identified by our method into a query, and demonstrate that this integration significantly improves retrieval effectiveness for a large set of natural language description queries derived from TREC topics on several newswire and web collections.