Classifying search engine queries using the web as background knowledge

Authors:
David Vogel;Steffen Bickel;Peter Haider;Rolf Schimpfky;Peter Siemen;Steve Bridges;Tobias Scheffer
Affiliations:
A.I. Insight, Inc., Orlando, Florida;Humboldt-Universität zu Berlin, Berlin, Germany;Humboldt-Universität zu Berlin, Berlin, Germany;Humboldt-Universität zu Berlin, Berlin, Germany;Humboldt-Universität zu Berlin, Berlin, Germany;MEDai, Inc., Orlando, Florida;Humboldt-Universität zu Berlin, Berlin, Germany
Venue:
ACM SIGKDD Explorations Newsletter
Year:
2005

Citing 12
Cited 17

Scatter/Gather: a cluster-based approach to browsing large document collections

SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
The paraphrase search assistant: terminological feedback for iterative information seeking

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Grouper: a dynamic clustering interface to Web search results

WWW '99 Proceedings of the eighth international conference on World Wide Web
Community search assistant

Proceedings of the 6th international conference on Intelligent user interfaces
On integrating catalogs

Proceedings of the 10th international conference on World Wide Web
Web searching: a process-oriented experimental study of three interactive search paradigms

Journal of the American Society for Information Science and Technology
Using part-of-speech patterns to reduce query ambiguity

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieving with Good Sense

Information Retrieval
On Combining Link and Contents Information for Web Page Clustering

DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
Word sense disambiguation in information retrieval revisited

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Learning to integrate web taxonomies

Web Semantics: Science, Services and Agents on the World Wide Web
Carrot2 and language properties in web search results clustering

AWIC'03 Proceedings of the 1st international Atlantic web intelligence conference on Advances in web intelligence

Building bridges for web query classification

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Query enrichment for web-query classification

ACM Transactions on Information Systems (TOIS)
Automatic classification of Web queries using very large unlabeled query logs

ACM Transactions on Information Systems (TOIS)
Robust classification of rare queries using web knowledge

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Search advertising using web relevance feedback

Proceedings of the 17th ACM conference on Information and knowledge management
Classifying search queries using the Web as a source of knowledge

ACM Transactions on the Web (TWEB)
Unsupervised query categorization using automatically-built concept graphs

Proceedings of the 18th international conference on World wide web
Understanding user's query intent with wikipedia

Proceedings of the 18th international conference on World wide web
Phrase clustering for discriminative learning

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Classification-enhanced ranking

Proceedings of the 19th international conference on World wide web
Applying taxonomic knowledge to Bayesian belief network for personalized search

Proceedings of the 2010 ACM Symposium on Applied Computing
Mining Query Logs: Turning Search Usage Data into Knowledge

Foundations and Trends in Information Retrieval
Mining Historic Query Trails to Label Long and Rare Search Engine Queries

ACM Transactions on the Web (TWEB)
Which should we try first? ranking information resources through query classification

FQAS'11 Proceedings of the 9th international conference on Flexible Query Answering Systems
A feature-free search query classification approach using semantic distance

Expert Systems with Applications: An International Journal
SEReleC# - C# implementation of SEReleC: a meta search engine based on combinatorial search and search keyword based link classification

Proceedings of the CUBE International Information Technology Conference
Geographic Information Retrieval and Text Mining on Chinese Tourism Web Pages

International Journal of Information Technology and Web Engineering

Quantified Score

Hi-index	0.01

Visualization

Abstract

The performance of search engines crucially depends on their ability to capture the meaning of a query most likely intended by the user. We study the problem of mapping a search engine query to those nodes of a given subject taxonomy that characterize its most likely meanings. We describe the architecture of a classification system that uses a web directory to identify the subject context that the query terms are frequently used in. Based on its performance on the classification of 800,000 example queries recorded from MSN search, the system received the Runner-Up Award for Query Categorization Performance of the KDD Cup 2005.