Query enrichment for web-query classification

Authors:
Dou Shen;Rong Pan;Jian-Tao Sun;Jeffrey Junfeng Pan;Kangheng Wu;Jie Yin;Qiang Yang
Affiliations:
Hong Kong University of Science and Technology, Hong Kong, China;Hong Kong University of Science and Technology, Hong Kong, China;Microsoft Research Asia, Beijing, China;Hong Kong University of Science and Technology, Hong Kong, China;Hong Kong University of Science and Technology, Hong Kong, China;Hong Kong University of Science and Technology, Hong Kong, China;Hong Kong University of Science and Technology, Hong Kong, China
Venue:
ACM Transactions on Information Systems (TOIS)
Year:
2006

Citing 24
Cited 32

A sequential algorithm for training text classifiers

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using lexical-semantic relations

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
On Combining Classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence
Integrating query expansion and conceptual relevance feedback for personalized Web information retrieval

WWW7 Proceedings of the seventh international conference on World Wide Web 7
The application of AdaBoost for distributed, scalable and on-line learning

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Analysis of a very large web search engine query log

ACM SIGIR Forum
Bringing order to the Web: automatically categorizing search results

Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Agglomerative clustering of a search engine query log

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
An Evaluation of Statistical Approaches to Text Categorization

Information Retrieval
Query clustering using user logs

ACM Transactions on Information Systems (TOIS)
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Neural Network Ensembles

IEEE Transactions on Pattern Analysis and Machine Intelligence
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Transductive Inference for Text Classification using Support Vector Machines

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Query type classification for web document retrieval

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Ensemble selection from libraries of models

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Introduction to Machine Learning (Adaptive Computation and Machine Learning)

Introduction to Machine Learning (Adaptive Computation and Machine Learning)
Automatic web query classification using labeled and unlabeled training data

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
KDD CUP-2005 report: facing a great challenge

ACM SIGKDD Explorations Newsletter
Q2C@UST: our winning solution to query classification in KDDCUP 2005

ACM SIGKDD Explorations Newsletter
The Ferrety algorithm for the KDD Cup 2005 problem

ACM SIGKDD Explorations Newsletter
Classifying search engine queries using the web as background knowledge

ACM SIGKDD Explorations Newsletter
Building bridges for web query classification

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval

Using query contexts in information retrieval

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Robust classification of rare queries using web knowledge

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Personal name classification in web queries

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Adapting information retrieval to query contexts

Information Processing and Management: an International Journal
Incorporating topical support documents into a small training set in text categorization

Proceedings of the 17th ACM conference on Information and knowledge management
Survey and evaluation of query intent detection methods

Proceedings of the 2009 workshop on Web Search Click Data
Classifying search queries using the Web as a source of knowledge

ACM Transactions on the Web (TWEB)
An algorithm for analyzing personalized online commercial intention

Proceedings of the 2nd International Workshop on Data Mining and Audience Intelligence for Advertising
Unsupervised query categorization using automatically-built concept graphs

Proceedings of the 18th international conference on World wide web
Identifying vertical search intention of query through social tagging propagation

Proceedings of the 18th international conference on World wide web
Quantifying Asymmetric Semantic Relations from Query Logs by Resource Allocation

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Mining web query hierarchies from clickthrough data

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Product query classification

Proceedings of the 18th ACM conference on Information and knowledge management
Exploiting term relationship to boost text classification

Proceedings of the 18th ACM conference on Information and knowledge management
Classifying web queries by topic and user intent

CHI '10 Extended Abstracts on Human Factors in Computing Systems
Learning recurrent event queries for web search

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Result enrichment in commerce search using browse trails

Proceedings of the fourth ACM international conference on Web search and data mining
Contextual Video Recommendation by Multimodal Relevance and User Feedback

ACM Transactions on Information Systems (TOIS)
Sparse hidden-dynamics conditional random fields for user intent understanding

Proceedings of the 20th international conference on World wide web
Real time search on the web: Queries, topics, and economic value

Information Processing and Management: an International Journal
Recognising and recommending context in social web search

UMAP'11 Proceedings of the 19th international conference on User modeling, adaption, and personalization
Towards the taxonomy-oriented categorization of yellow pages queries

ACM Transactions on Internet Technology (TOIT)
Recommending case bases: applications in social web search

ICCBR'11 Proceedings of the 19th international conference on Case-Based Reasoning Research and Development
A feature-free search query classification approach using semantic distance

Expert Systems with Applications: An International Journal
An evaluation of classification models for question topic categorization

Journal of the American Society for Information Science and Technology
Short text classification improved by learning multi-granularity topics

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
CluChunk: clustering large scale user-generated content incorporating chunklet information

Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Query classification using topic models and support vector machine

ACL '12 Proceedings of ACL 2012 Student Research Workshop
Concept based query recommendation

AusDM '11 Proceedings of the Ninth Australasian Data Mining Conference - Volume 121
Short text classification by detecting information path

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Mining search and browse logs for web search: A Survey

ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Feature engineering for semantic place prediction

Pervasive and Mobile Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Web-search queries are typically short and ambiguous. To classify these queries into certain target categories is a difficult but important problem. In this article, we present a new technique called query enrichment, which takes a short query and maps it to intermediate objects. Based on the collected intermediate objects, the query is then mapped to target categories. To build the necessary mapping functions, we use an ensemble of search engines to produce an enrichment of the queries. Our technique was applied to the ACM Knowledge Discovery and Data Mining competition (ACM KDDCUP) in 2005, where we won the championship on all three evaluation metrics (precision, F1 measure, which combines precision and recall, and creativity, which is judged by the organizers) among a total of 33 teams worldwide. In this article, we show that, despite the difficulty of an abundance of ambiguous queries and lack of training data, our query-enrichment technique can solve the problem satisfactorily through a two-phase classification framework. We present a detailed description of our algorithm and experimental evaluation. Our best result for F1 and precision is 42.4% and 44.4%, respectively, which is 9.6% and 24.3% higher than those from the runner-ups, respectively.