Elements of information theory
Elements of information theory
Evaluating and optimizing autonomous text classification systems
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Real life, real users, and real needs: a study and analysis of user queries on the web
Information Processing and Management: an International Journal
A vector space model for automatic indexing
Communications of the ACM
Clustering user queries of a search engine
Proceedings of the 10th international conference on World Wide Web
Query clustering using content words and user feedback
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Query clustering using user logs
ACM Transactions on Information Systems (TOIS)
Information Retrieval
Machine Learning
Query type classification for web document retrieval
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Categorizing web queries according to geographical locality
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Hourly analysis of a very large topically categorized web query log
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic web query classification using labeled and unlabeled training data
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Disambiguating Nouns, Verbs, and Adjectives Using Automatically Acquired Selectional Preferences
Computational Linguistics
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Coupling feature selection and machine learning methods for navigational query identification
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Automatic classification of Web queries using very large unlabeled query logs
ACM Transactions on Information Systems (TOIS)
Robust classification of rare queries using web knowledge
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Varying approaches to topical web query classification
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Learning query intent from regularized click graphs
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Electronic Commerce Research and Applications
Search advertising using web relevance feedback
Proceedings of the 17th ACM conference on Information and knowledge management
Analysis of varying approaches to topical web query classification
Proceedings of the 3rd international conference on Scalable information systems
Survey and evaluation of query intent detection methods
Proceedings of the 2009 workshop on Web Search Click Data
Topic-specific analysis of search queries
Proceedings of the 2009 workshop on Web Search Click Data
Classifying search queries using the Web as a source of knowledge
ACM Transactions on the Web (TWEB)
Understanding user's query intent with wikipedia
Proceedings of the 18th international conference on World wide web
Context-aware query classification
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Sources of evidence for vertical selection
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Adaptation of offline vertical selection predictions in the presence of user feedback
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Classification-based resource selection
Proceedings of the 18th ACM conference on Information and knowledge management
Using word-sense disambiguation methods to classify web queries by intent
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Precomputing search features for fast and accurate query classification
Proceedings of the third ACM international conference on Web search and data mining
Towards rich query interpretation: walking back and forth for mining query templates
Proceedings of the 19th international conference on World wide web
Learning with click graph for query intent classification
ACM Transactions on Information Systems (TOIS)
Use of temporal expressions in web search
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Mining Query Logs: Turning Search Usage Data into Knowledge
Foundations and Trends in Information Retrieval
Mining Historic Query Trails to Label Long and Rare Search Engine Queries
ACM Transactions on the Web (TWEB)
Exploring social annotation tags to enhance information retrieval performance
AMT'10 Proceedings of the 6th international conference on Active media technology
Query classification using Wikipedia
International Journal of Intelligent Information and Database Systems
Acquiring knowledge about human goals from Search Query Logs
Information Processing and Management: an International Journal
A multi-faceted approach to query intent classification
SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
Improving context-aware query classification via adaptive self-training
Proceedings of the 20th ACM international conference on Information and knowledge management
A taxonomy of local search: semi-supervised query classification driven by information needs
Proceedings of the 20th ACM international conference on Information and knowledge management
Towards the taxonomy-oriented categorization of yellow pages queries
ACM Transactions on Internet Technology (TOIT)
Categorization of large text collections: feature selection for training neural networks
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
An evaluation of classification models for question topic categorization
Journal of the American Society for Information Science and Technology
On minimum distribution discrepancy support vector machine for domain adaptation
Pattern Recognition
Source-selection-free transfer learning
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Confidence-aware graph regularization with heterogeneous pairwise features
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Hierarchical target type identification for entity-oriented queries
Proceedings of the 21st ACM international conference on Information and knowledge management
Search intent discovery by structurization of community QA contents
WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
Joint question clustering and relevance prediction for open domain non-factoid question answering
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
Accurate topical classification of user queries allows for increased effectiveness and efficiency in general-purpose web search systems. Such classification becomes critical if the system is to return results not just from a general web collection but from topic-specific back-end databases as well. Maintaining sufficient classification recall is very difficult as web queries are typically short, yielding few features per query. This feature sparseness coupled with the high query volumes typical for a large-scale search service makes manual and supervised learning approaches alone insufficient. We use an application of computational linguistics to develop an approach for mining the vast amount of unlabeled data in web query logs to improve automatic topical web query classification. We show that our approach in combination with manual matching and supervised learning allows us to classify a substantially larger proportion of queries than any single technique. We examine the performance of each approach on a real web query stream and show that our combined method accurately classifies 46% of queries, outperforming the recall of best single approach by nearly 20%, with a 7% improvement in overall effectiveness.