Mining broad latent query aspects from search sessions

Authors:
Xuanhui Wang;Deepayan Chakrabarti;Kunal Punera
Affiliations:
University of Illinois at Urbana-Champaign, Urbana, IL, USA;Yahoo! Research, Sunnyvale, CA, USA;Yahoo! Research, Sunnyvale, CA, USA
Venue:
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2009

Citing 19
Cited 11

Optimization of relevance feedback weights

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Analysis of a very large web search engine query log

ACM SIGIR Forum
Similarity estimation techniques from rounding algorithms

STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
Probabilistic query expansion using query logs

Proceedings of the 11th international conference on World Wide Web
Modern Information Retrieval

Modern Information Retrieval
Improving pseudo-relevance feedback in web information retrieval using web page segmentation

WWW '03 Proceedings of the 12th international conference on World Wide Web
Query expansion using associated queries

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Identifying similarities, periodicities and bursts for online search queries

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
A temporal comparison of AltaVista Web searching: Research Articles

Journal of the American Society for Information Science and Technology
Semantic similarity between search engine queries using temporal correlation

WWW '05 Proceedings of the 14th international conference on World Wide Web
Introduction to Data Mining, (First Edition)

Introduction to Data Mining, (First Edition)
Concept-based interactive query expansion

Proceedings of the 14th ACM international conference on Information and knowledge management
Generating query substitutions

Proceedings of the 15th international conference on World Wide Web
Personalized query expansion for the web

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
An experimental comparison of click position-bias models

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Using the wisdom of the crowds for keyword generation

Proceedings of the 17th international conference on World Wide Web
Context-aware query suggestion by mining click-through and session data

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
The query-flow graph: model and applications

Proceedings of the 17th ACM conference on Information and knowledge management
Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs

Proceedings of the 17th ACM conference on Information and knowledge management

Clustering query refinements by user intent

Proceedings of the 19th international conference on World wide web
Building taxonomy of web search intents for name entity queries

Proceedings of the 19th international conference on World wide web
Identifying aspects for web-search queries

Journal of Artificial Intelligence Research
Topic modeling for named entity queries

Proceedings of the 20th ACM international conference on Information and knowledge management
The role of query sessions in extracting instance attributes from web search queries

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Unsupervised extraction of template structure in web search queries

Proceedings of the 21st international conference on World Wide Web
Multi-aspect query summarization by composite query

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
The wisdom of advertisers: mining subgoals via query clustering

Proceedings of the 21st ACM international conference on Information and knowledge management
Role-explicit query identification and intent role annotation

Proceedings of the 21st ACM international conference on Information and knowledge management
Extracting query facets from search results

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Heterogeneous graph-based intent learning with queries, web pages and Wikipedia concepts

Proceedings of the 7th ACM international conference on Web search and data mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

Search queries are typically very short, which means they are often underspecified or have senses that the user did not think of. A broad latent query aspect is a set of keywords that succinctly represents one particular sense, or one particular information need, that can aid users in reformulating such queries. We extract such broad latent aspects from query reformulations found in historical search session logs. We propose a framework under which the problem of extracting such broad latent aspects reduces to that of optimizing a formal objective function under constraints on the total number of aspects the system can store, and the number of aspects that can be shown in response to any given query. We present algorithms to find a good set of aspects, and also to pick the best k aspects matching any query. Empirical results on real-world search engine logs show significant gains over a strong baseline that uses single-keyword reformulations: a gain of 14% and 23% in terms of human-judged accuracy and click-through data respectively, and around 20% in terms of consistency among aspects predicted for "similar" queries. This demonstrates both the importance of broad query aspects, and the efficacy of our algorithms for extracting them.