Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
A Tutorial on Support Vector Machines for Pattern Recognition
Data Mining and Knowledge Discovery
Reducing multiclass to binary: a unifying approach for margin classifiers
The Journal of Machine Learning Research
Nymble: a high-performance learning name-finder
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Features for unsupervised document classification
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Error-correcting output codes: a general method for improving multiclass inductive learning programs
AAAI'91 Proceedings of the ninth National conference on Artificial intelligence - Volume 2
Learning question classifiers: the role of semantic information
Natural Language Engineering
Enhanced answer type inference from questions using sequential models
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Multilingual Question Classification based on surface text features
Proceedings of the 2005 conference on Artificial Intelligence Research and Development
Question classification using head words and their hypernyms
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Subtree mining for question classification problem
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Semantically Expanding Questions for Supervised Automatic Classification
FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Investigation of question classifier in question answering
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
BRUJA: question classification for Spanish. Using machine translation and an English classifier
MLQA '06 Proceedings of the Workshop on Multilingual Question Answering
Answering questions with an n-gram based passage retrieval engine
Journal of Intelligent Information Systems
A semantic approach for question classification using WordNet and Wikipedia
Pattern Recognition Letters
Question classification based on co-training style semi-supervised learning
Pattern Recognition Letters
Enhanced semantic expansion for question classification
International Journal of Internet Technology and Secured Transactions
Improving subtree-based question classification classifiers with word-cluster models
NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Exploiting unlabeled data for question classification
NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Question classification for a Croatian QA system
TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Using semi-supervised learning for question classification
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
A multilingual SVM-based question classification system
MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
AI'10 Proceedings of the 23rd Canadian conference on Advances in Artificial Intelligence
Benchmarking data mining methods in CAT
ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing Theories and Applications: with aspects of artificial intelligence
A support vector machine-based context-ranking model for question answering
Information Sciences: an International Journal
Learning regular expressions to template-based FAQ retrieval systems
Knowledge-Based Systems
Hi-index | 0.00 |
In this paper we consider a machine learning technique for question classification. The goal is to replace our regular expression based classifier with a classifier that learns from a set of labeled questions. We have realized that an enourmous amount of time is required to create a rich collection of patterns and keywords for a good coverage of questions in an open-domain application. We decided to use support vector machines, since they have been successfully used for a number of benchmark problems. Although the support vector machines are inherently binary classifiers, it is possible to extend their use as multi-class classifiers using binary codes. We represent questions as frequency weighted vectors of salient terms. We compare our approcah to related work that uses relatively complex syntactic/semantic processing to create features and a sparse network of linear units to classify questions. We provide results to show performance of the method.