A sequential algorithm for training text classifiers
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Machine Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
The application of AdaBoost for distributed, scalable and on-line learning
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Bringing order to the Web: automatically categorizing search results
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Agglomerative clustering of a search engine query log
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Information Retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Transductive Inference for Text Classification using Support Vector Machines
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Query type classification for web document retrieval
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Ensemble selection from libraries of models
ICML '04 Proceedings of the twenty-first international conference on Machine learning
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Building bridges for web query classification
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Query enrichment for web-query classification
ACM Transactions on Information Systems (TOIS)
Coupling feature selection and machine learning methods for navigational query identification
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Automatic classification of Web queries using very large unlabeled query logs
ACM Transactions on Information Systems (TOIS)
ACM SIGKDD Explorations Newsletter
A large-scale evaluation and analysis of personalized search strategies
Proceedings of the 16th international conference on World Wide Web
Identifying ambiguous queries in web search
Proceedings of the 16th international conference on World Wide Web
Robust classification of rare queries using web knowledge
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Noise reduction through summarization for Web-page classification
Information Processing and Management: an International Journal
Using the wisdom of the crowds for keyword generation
Proceedings of the 17th international conference on World Wide Web
Search advertising using web relevance feedback
Proceedings of the 17th ACM conference on Information and knowledge management
Analysis of varying approaches to topical web query classification
Proceedings of the 3rd international conference on Scalable information systems
Identification of ambiguous queries in web search
Information Processing and Management: an International Journal
Classifying search queries using the Web as a source of knowledge
ACM Transactions on the Web (TWEB)
Unsupervised query categorization using automatically-built concept graphs
Proceedings of the 18th international conference on World wide web
Understanding user's query intent with wikipedia
Proceedings of the 18th international conference on World wide web
Sources of evidence for vertical selection
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Lycos Retriever: an information fusion engine
NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Query Classification Based on Regularized Correlated Topic Model
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
PQC: personalized query classification
Proceedings of the 18th ACM conference on Information and knowledge management
Classification-based resource selection
Proceedings of the 18th ACM conference on Information and knowledge management
Phrase clustering for discriminative learning
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Domain Specific Opinion Retrieval
AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Fuzzy clustering based ad recommendation for TV programs
EuroITV'07 Proceedings of the 5th European conference on Interactive TV: a shared experience
Classification-enhanced ranking
Proceedings of the 19th international conference on World wide web
Ranking using multi-features in blog search
PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
Mining Query Logs: Turning Search Usage Data into Knowledge
Foundations and Trends in Information Retrieval
A large-scale study on map search logs
ACM Transactions on the Web (TWEB)
Context-aware ranking in web search
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Mining Historic Query Trails to Label Long and Rare Search Engine Queries
ACM Transactions on the Web (TWEB)
Learning recurrent event queries for web search
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Multidimensional mining of large-scale search logs: a topic-concept cube approach
Proceedings of the fourth ACM international conference on Web search and data mining
Multi-dimensional search result diversification
Proceedings of the fourth ACM international conference on Web search and data mining
Learning query ambiguity models by using search logs
Journal of Computer Science and Technology
Query classification using Wikipedia
International Journal of Intelligent Information and Database Systems
Exploring wikipedia's category graph for query classification
AIS'11 Proceedings of the Second international conference on Autonomous and intelligent systems
Query classification based on index association rule expansion
WISM'11 Proceedings of the 2011 international conference on Web information systems and mining - Volume Part II
Query-feature graphs: bridging user vocabulary and system functionality
Proceedings of the 24th annual ACM symposium on User interface software and technology
Learning to rank categories for web queries
Proceedings of the 20th ACM international conference on Information and knowledge management
Which should we try first? ranking information resources through query classification
FQAS'11 Proceedings of the 9th international conference on Flexible Query Answering Systems
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Evaluating the effectiveness of search task trails
Proceedings of the 21st international conference on World Wide Web
A unified search federation system based on online user feedback
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
"Piaf" vs "Adele": classifying encyclopedic queries using automatically labeled training data
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Predicting event-relatedness of popular queries
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
In this paper, we describe our ensemble-search based approach, Q2C@UST (http://webprojectl.cs.ust.hk/q2c/), for the query classification task for the KDDCUP 2005. There are two aspects to the key difficulties of this problem: one is that the meaning of the queries and the semantics of the predefined categories are hard to determine. The other is that there are no training data for this classification problem. We apply a two-phase framework to tackle the above difficulties. Phase I corresponds to the training phase of machine learning research and phase II corresponds to testing phase. In phase I, two kinds of classifiers are developed as the base classifiers. One is synonym-based and the other is statistics based. Phase II consists of two stages. In the first stage, the queries are enriched such that for each query, its related Web pages together with their category information are collected through the use of search engines. In the second stage, the enriched queries are classified through the base classifiers trained in phase I. Based on the classification results obtained by the base classifiers, two ensemble classifiers based on two different strategies are proposed. The experimental results on the validation dataset help confirm our conjectures on the performance of the Q2C@UST system. In addition, the evaluation results given by the KDDCUP 2005 organizer confirm the effectiveness of our proposed approaches. The best F1 value of our two solutions is 9.6% higher than the best of all other participants' solutions. The average F1 value of our two submitted solutions is 94.4% higher than the average F1 value from all other submitted solutions.