An efficient approach to suggesting topically related web queries using hidden topic model

Authors:
Lin Li;Guandong Xu;Zhenglu Yang;Peter Dolog;Yanchun Zhang;Masaru Kitsuregawa
Affiliations:
School of Computer Science and Technolgoy, Wuhan University of Technology, Wuhan, China;Department of Computer Science, Aalborg University, Aalborg, Denmark;Institute of Industrial Science, The University of Tokyo, Tokyo, Japan;Department of Computer Science, Aalborg University, Aalborg, Denmark;School of Engineering & Science, Victoria University, Melbourne, Australia;Institute of Industrial Science, The University of Tokyo, Tokyo, Japan
Venue:
World Wide Web
Year:
2013

Citing 41
Cited 2

Query expansion using lexical-semantic relations

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic feedback using past queries: social searching?

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Real life information retrieval: a study of user queries on the Web

ACM SIGIR Forum
Improving the effectiveness of information retrieval with local context analysis

ACM Transactions on Information Systems (TOIS)
Agglomerative clustering of a search engine query log

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Community search assistant

Proceedings of the 6th international conference on Intelligent user interfaces
Query clustering using user logs

ACM Transactions on Information Systems (TOIS)
SimRank: a measure of structural-context similarity

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Query Expansion by Mining User Logs

IEEE Transactions on Knowledge and Data Engineering
Latent dirichlet allocation

The Journal of Machine Learning Research
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Semantic similarity between search engine queries using temporal correlation

WWW '05 Proceedings of the 14th international conference on World Wide Web
Concept-based interactive query expansion

Proceedings of the 14th ACM international conference on Information and knowledge management
Query expansion using random walk models

Proceedings of the 14th ACM international conference on Information and knowledge management
Neighborhood Formation and Anomaly Detection in Bipartite Graphs

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Mining dependency relations for query expansion in passage retrieval

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Scaling up all pairs similarity search

Proceedings of the 16th international conference on World Wide Web
Query topic detection for reformulation

Proceedings of the 16th international conference on World Wide Web
Personalized query expansion for the web

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Popularity and findability through log analysis of search terms and queries: the case of a multilingual public service website

Journal of Information Science
Improving search engines by query clustering

Journal of the American Society for Information Science and Technology
Mining related queries from Web search engine query logs using an improved association rule mining model

Journal of the American Society for Information Science and Technology
Introduction to Information Retrieval

Introduction to Information Retrieval
Context-aware query suggestion by mining click-through and session data

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Query Recommendation Using Large-Scale Web Access Logs and Web Page Archive

DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
Query suggestion using hitting time

Proceedings of the 17th ACM conference on Information and knowledge management
Learning latent semantic relations from clickthrough data for query suggestion

Proceedings of the 17th ACM conference on Information and knowledge management
Search-based query suggestion

Proceedings of the 17th ACM conference on Information and knowledge management
Hierarchical location and topic based query expansion

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Query-URL bipartite based approach to personalized query recommendation

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
The Effectiveness of Latent Semantic Analysis for Building Up a Bottom-up Taxonomy from Folksonomy Tags

World Wide Web
Relaxing RDF queries based on user and domain preferences

Journal of Intelligent Information Systems
Finding Related Search Engine Queries by Web Community Based Query Enrichment

World Wide Web
Effects of popularity and quality on the usage of query suggestions during information search

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Optimal rare query suggestion with implicit user feedback

Proceedings of the 19th international conference on World wide web
Suggesting Topic-Based Query Terms as You Type

APWEB '10 Proceedings of the 2010 12th International Asia-Pacific Web Conference
Towards query log based personalization using topic models

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Probabilistic latent semantic analysis

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Query expansion using web access log files

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
An analysis of query similarity in collaborative web search

ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
Divergence measures based on the Shannon entropy

IEEE Transactions on Information Theory

Probabilistic Web Data Management

World Wide Web
Personalized Query Expansion for Web Search Using Social Keywords

Proceedings of International Conference on Information Integration and Web-based Applications & Services

Quantified Score

Hi-index	0.00

Visualization

Abstract

Keyword-based Web search is a widely used approach for locating information on the Web. However, Web users usually suffer from the difficulties of organizing and formulating appropriate input queries due to the lack of sufficient domain knowledge, which greatly affects the search performance. An effective tool to meet the information needs of a search engine user is to suggest Web queries that are topically related to their initial inquiry. Accurately computing query-to-query similarity scores is a key to improve the quality of these suggestions. Because of the short lengths of queries, traditional pseudo-relevance or implicit-relevance based approaches expand the expression of the queries for the similarity computation. They explicitly use a search engine as a complementary source and directly extract additional features (such as terms or URLs) from the top-listed or clicked search results. In this paper, we propose a novel approach by utilizing the hidden topic as an expandable feature. This has two steps. In the offline model-learning step, a hidden topic model is trained, and for each candidate query, its posterior distribution over the hidden topic space is determined to re-express the query instead of the lexical expression. In the online query suggestion step, after inferring the topic distribution for an input query in a similar way, we then calculate the similarity between candidate queries and the input query in terms of their corresponding topic distributions; and produce a suggestion list of candidate queries based on the similarity scores. Our experimental results on two real data sets show that the hidden topic based suggestion is much more efficient than the traditional term or URL based approach, and is effective in finding topically related queries for suggestion.