Mining Concept Sequences from Large-Scale Search Logs for Context-Aware Query Suggestion

Authors:
Zhen Liao;Daxin Jiang;Enhong Chen;Jian Pei;Huanhuan Cao;Hang Li
Affiliations:
Nankai University;Microsoft Research Asia;University of Science and Technology of China;Simon Fraser University;University of Science and Technology of China;Microsoft Research Asia
Venue:
ACM Transactions on Intelligent Systems and Technology (TIST)
Year:
2011

Citing 41
Cited 4

Silhouettes: a graphical aid to the interpretation and validation of cluster analysis

Journal of Computational and Applied Mathematics
BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
The potential and actual effectiveness of interactive query expansion

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Real life information retrieval: a study of user queries on the Web

ACM SIGIR Forum
Patterns of search: analyzing and modeling Web query refinement

UM '99 Proceedings of the seventh international conference on User modeling
Analysis of a very large web search engine query log

ACM SIGIR Forum
Agglomerative clustering of a search engine query log

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering user queries of a search engine

Proceedings of the 10th international conference on World Wide Web
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic query expansion using query logs

Proceedings of the 11th international conference on World Wide Web
Mining Sequential Patterns: Generalizations and Performance Improvements

EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth

Proceedings of the 17th International Conference on Data Engineering
Optimal Grid-Clustering: Towards Breaking the Curse of Dimensionality in High-Dimensional Clustering

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Relevant term suggestion in interactive web search based on contextual information in query session logs

Journal of the American Society for Information Science and Technology
Mining anchor text for query refinement

Proceedings of the 13th international conference on World Wide Web
An effective approach to document retrieval via utilizing WordNet and recognizing phrases

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Scoring missing terms in information retrieval tasks

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Introduction to Data Mining, (First Edition)

Introduction to Data Mining, (First Edition)
Concept-based interactive query expansion

Proceedings of the 14th ACM international conference on Information and knowledge management
A web-based kernel function for measuring the similarity of short text snippets

Proceedings of the 15th international conference on World Wide Web
Generating query substitutions

Proceedings of the 15th international conference on World Wide Web
MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Personalized query expansion for the web

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Studying the use of popular destinations to enhance web search interaction

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Random walks on the click graph

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Latent concept expansion using markov random fields

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Extracting semantic relations from query logs

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Selecting good expansion terms for pseudo-relevance feedback

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A unified and discriminative model for query refinement

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Context-aware query suggestion by mining click-through and session data

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Query suggestion using hitting time

Proceedings of the 17th ACM conference on Information and knowledge management
The query-flow graph: model and applications

Proceedings of the 17th ACM conference on Information and knowledge management
Query suggestions using query-flow graphs

Proceedings of the 2009 workshop on Web Search Click Data
Smoothing clickthrough data for web search ranking

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Predicting user interests from contextual information

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
An analysis framework for search sequences

Proceedings of the 18th ACM conference on Information and knowledge management
An optimization framework for query recommendation

Proceedings of the third ACM international conference on Web search and data mining
Clustering query refinements by user intent

Proceedings of the 19th international conference on World wide web
Predicting short-term interests using activity-based search context

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Query recommendation using query logs in search engines

EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
Query phrase suggestion from topically tagged session logs

FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems

Evaluating the effectiveness of search task trails

Proceedings of the 21st international conference on World Wide Web
Learning to personalize query auto-completion

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
A vlHMM approach to context-aware search

ACM Transactions on the Web (TWEB)
Mining search and browse logs for web search: A Survey

ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers

Quantified Score

Hi-index	0.00

Visualization

Abstract

Query suggestion plays an important role in improving usability of search engines. Although some recently proposed methods provide query suggestions by mining query patterns from search logs, none of them models the immediately preceding queries as context systematically, and uses context information effectively in query suggestions. Context-aware query suggestion is challenging in both modeling context and scaling up query suggestion using context. In this article, we propose a novel context-aware query suggestion approach. To tackle the challenges, our approach consists of two stages. In the first, offline model-learning stage, to address data sparseness, queries are summarized into concepts by clustering a click-through bipartite. A concept sequence suffix tree is then constructed from session data as a context-aware query suggestion model. In the second, online query suggestion stage, a user’s search context is captured by mapping the query sequence submitted by the user to a sequence of concepts. By looking up the context in the concept sequence suffix tree, we suggest to the user context-aware queries. We test our approach on large-scale search logs of a commercial search engine containing 4.0 billion Web queries, 5.9 billion clicks, and 1.87 billion search sessions. The experimental results clearly show that our approach outperforms three baseline methods in both coverage and quality of suggestions.