Finding Related Search Engine Queries by Web Community Based Query Enrichment

Authors:
Lin Li;Shingo Otsuka;Masaru Kitsuregawa
Affiliations:
Department of Information and Communication Engineering, The University of Tokyo, Tokyo, Japan and School of Computer Science and Technology, Wuhan University of Technology, Wuhan, China;National Institute for Materials Science, Tsukuba, Japan;Institute of Industrial Science, The University of Tokyo, Tokyo, Japan
Venue:
World Wide Web
Year:
2010

Citing 31
Cited 3

Query expansion using lexical-semantic relations

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Characterizing browsing strategies in the World-Wide Web

Proceedings of the Third International World-Wide Web conference on Technology, tools and applications
On the reuse of past optimal queries

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using local and global document analysis

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic feedback using past queries: social searching?

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Inferring Web communities from link topology

Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems
Real life information retrieval: a study of user queries on the Web

ACM SIGIR Forum
Finding related pages in the World Wide Web

WWW '99 Proceedings of the eighth international conference on World Wide Web
Trawling the Web for emerging cyber-communities

WWW '99 Proceedings of the eighth international conference on World Wide Web
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Improving the effectiveness of information retrieval with local context analysis

ACM Transactions on Information Systems (TOIS)
Efficient identification of Web communities

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Agglomerative clustering of a search engine query log

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Community search assistant

Proceedings of the 6th international conference on Intelligent user interfaces
Query clustering using user logs

ACM Transactions on Information Systems (TOIS)
Creating a Web community chart for navigating related communities

Proceedings of the 12th ACM conference on Hypertext and Hypermedia
Self-Organization and Identification of Web Communities

Computer
Query Expansion by Mining User Logs

IEEE Transactions on Knowledge and Data Engineering
Web Communities: Models and Algorithms

World Wide Web
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Query expansion using random walk models

Proceedings of the 14th ACM international conference on Information and knowledge management
Mining search engine query logs for query recommendation

Proceedings of the 15th international conference on World Wide Web
Mining dependency relations for query expansion in passage retrieval

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Discovering Interesting Relationships among Deep Web Databases: A Source-Biased Approach

World Wide Web
Personalized query expansion for the web

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Improving search engines by query clustering

Journal of the American Society for Information Science and Technology
Mining related queries from Web search engine query logs using an improved association rule mining model

Journal of the American Society for Information Science and Technology
A Novelty-based Clustering Method for On-line Documents

World Wide Web
Introduction to Information Retrieval

Introduction to Information Retrieval
Query-URL bipartite based approach to personalized query recommendation

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Query expansion using web access log files

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications

A path-based approach for web page retrieval

World Wide Web
An efficient approach to suggesting topically related web queries using hidden topic model

World Wide Web
QUBiC: An adaptive approach to query-based recommendation

Journal of Intelligent Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The conventional approaches of finding related search engine queries rely on the common terms shared by two queries to measure their relatedness. However, search engine queries are usually short and the term overlap between two queries is very small. Using query terms as a feature space cannot accurately estimate relatedness. Alternative feature spaces are needed to enrich the term based search queries. In this paper, given a search query, first we extract the Web pages accessed by users from Japanese Web access logs which store the users individual and collective behavior. From these accessed Web pages we usually can get two kinds of feature spaces, i.e, content-sensitive (e.g., nouns) and content-ignorant (e.g., URLs), to enrich the expressions of search queries. Then, the relatedness between search queries can be estimated on their enriched expressions. Our experimental results show that the URL feature space produces much lower precision scores than the noun feature space which, however, is not applicable in non-text pages, dynamic pages and so on. It is crucial to improve the quality of the URL (content-ignorant) feature space since it is generally available in all types of Web pages. We propose a novel content-ignorant feature space, called Web community which is created from a Japanese Web page archive by exploiting link analysis. Experimental results show that the proposed Web community feature space generates much better results than the URL feature space.