Random selection assisted long web search query optimization

Authors:
Teng-Sheng Moh;Jehaan Irani
Affiliations:
San Jose State University, San Jose, CA;San Jose State University, San Jose, CA
Venue:
Proceedings of the 50th Annual Southeast Regional Conference
Year:
2012

Citing 8
Cited 0

Predicting query performance

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Generating query substitutions

Proceedings of the 15th international conference on World Wide Web
Unsupervised query segmentation using generative language models and wikipedia

Proceedings of the 17th international conference on World Wide Web
A unified and discriminative model for query refinement

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Improved query difficulty prediction for the web

Proceedings of the 17th ACM conference on Information and knowledge management
Reducing long queries using query quality predictors

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
A Query Substitution-Search Result Refinement Approach for Long Query Web Searches

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Exploring reductions for long web queries

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Commercial search engines do not return optimal search results when the query is a long or multi-topic one [1]. Long queries are used extensively. While the creator of the long query would most likely use natural language to describe the query, it usually contains extra information. This information dilutes the results of a web search, and hence decreases the performance as well as the quality of the results returned. Kumaran et al. [14] showed that shorter queries extracted from longer user generated queries are more effective for ad-hoc retrieval. Hence, reducing the query length by removing extra terms improves the quality of the search results. There are numerous approaches used to address this shortfall. Our approach evaluates various versions of the query, trying to find the optimal one. This variation is achieved by reducing the query length using a random keyword combination. We use existing models and plug in information with the help of randomization to improve the overall performance while keeping any overhead calculations in check.