Quantifying query ambiguity

Authors:
Steve Cronen-Townsend;W. Bruce Croft
Affiliations:
University of Massachusetts, Amherst, MA;University of Massachusetts, Amherst, MA
Venue:
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Year:
2002

Citing 13
Cited 31

Elements of information theory

Elements of information theory
The automatic identification of stop words

Journal of Information Science
Lexical ambiguity and information retrieval

ACM Transactions on Information Systems (TOIS)
Viewing morphology as an inference process

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A general language model for information retrieval (poster abstract)

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
The impact on retrieval effectiveness of skewed frequency distributions

ACM Transactions on Information Systems (TOIS)
Corpus-based statistical screening for content-bearing terms

Journal of the American Society for Information Science and Technology
Employing the resolution power of search keys

Journal of the American Society for Information Science and Technology
Locating question difficulty through explorations in question space

Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Predicting query performance

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval

Predicting query performance

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Using part-of-speech patterns to reduce query ambiguity

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Topic structure modeling

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
More Efficient Searching in a Knowledge Portal - An Approach Based on the Analysis of Users' Queries

PAKM '02 Proceedings of the 4th International Conference on Practical Aspects of Knowledge Management
Newsjunkie: providing personalized newsfeeds via analysis of information novelty

Proceedings of the 13th international conference on World Wide Web
Evaluating high accuracy retrieval techniques

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A large-scale evaluation and analysis of personalized search strategies

Proceedings of the 16th international conference on World Wide Web
Quantify query ambiguity using ODP metadata

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Blogger, stick to your story: modeling topical noise in blogs with coherence measures

Proceedings of the second workshop on Analytics for noisy unstructured text data
Estimating query performance using class predictions

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Work in progress: effects of multiple words on ambiguity in information retrieval

Proceedings of the 46th Annual Southeast Regional Conference on XX
On the query refinement in the ontology-based searching for information

Information Systems - Special issue: The 15th international conference on advanced information systems engineering (CAiSE 2003)
On the query refinement in the ontology-based searching for information

CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
Using coherence-based measures to predict query difficulty

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Crew: cross-modal resource searching by exploiting wikipedia

Proceedings of the international conference on Multimedia
Investigating retrieval performance with manually-built topic models

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Personalizing web search using long term browsing history

Proceedings of the fourth ACM international conference on Web search and data mining
Towards a collection-based results diversification

RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Implicit association via crowd-sourced coselection

Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Detecting outlier sections in us congressional legislation

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Query suggestions in the absence of query logs

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
A Survey of Automatic Query Expansion in Information Retrieval

ACM Computing Surveys (CSUR)
UB at CLEF2004: cross language information retrieval using statistical language models

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
Explaining query modifications: an alternative interpretation of term addition and removal

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
A new search engine integrating hierarchical browsing and keyword search

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Click patterns: an empirical representation of complex query intents

Proceedings of the 21st ACM international conference on Information and knowledge management
Generating pseudo test collections for learning to rank scientific articles

CLEF'12 Proceedings of the Third international conference on Information Access Evaluation: multilinguality, multimodality, and visual analytics
Pseudo test collections for training and tuning microblog rankers

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
A Diagnostic Study of Search Result Diversification Methods

Proceedings of the 2013 Conference on the Theory of Information Retrieval
Improving entity search over linked data by modeling latent semantics

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Detecting verbose queries and improving information retrieval

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

We develop a measure of a query with respect to a collection of documents with the aim of quantifying the query's ambiguity with respect to those documents. This measure, the clarity score, is the relative entropy between a query language model and the corresponding collection language model. We substantiate that the clarity score measures the coherence and specificity of the language used in documents likely to satisfy the query. We also argue that it provides a suitable quantification of the (lack of) ambiguity of a query with respect to a collection of documents and has potential applications throughout the field of information retrieval. In particular, the clarity score is shown to correlate positively with average precision in evaluations using TREC test collections. Hence, as one example, the clarity score could serve as a predictor of query performance. Systems would then be able to identify vague information requests and respond differently than they would to clear and specific requests.