Question classification with support vector machines and error correcting codes

Authors:
Kadri Hacioglu;Wayne Ward
Affiliations:
University of Colorado at Boulder;University of Colorado at Boulder
Venue:
NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Year:
2003

Citing 7
Cited 22

Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
Reducing multiclass to binary: a unifying approach for margin classifiers

The Journal of Machine Learning Research
Nymble: a high-performance learning name-finder

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Learning question classifiers

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Features for unsupervised document classification

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Error-correcting output codes: a general method for improving multiclass inductive learning programs

AAAI'91 Proceedings of the ninth National conference on Artificial intelligence - Volume 2

Learning question classifiers: the role of semantic information

Natural Language Engineering
Enhanced answer type inference from questions using sequential models

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Multilingual Question Classification based on surface text features

Proceedings of the 2005 conference on Artificial Intelligence Research and Development
Question classification using head words and their hypernyms

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Subtree mining for question classification problem

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Semantically Expanding Questions for Supervised Automatic Classification

FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Investigation of question classifier in question answering

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
BRUJA: question classification for Spanish. Using machine translation and an English classifier

MLQA '06 Proceedings of the Workshop on Multilingual Question Answering
Answering questions with an n-gram based passage retrieval engine

Journal of Intelligent Information Systems
A semantic approach for question classification using WordNet and Wikipedia

Pattern Recognition Letters
Question classification based on co-training style semi-supervised learning

Pattern Recognition Letters
Enhanced semantic expansion for question classification

International Journal of Internet Technology and Secured Transactions
Improving subtree-based question classification classifiers with word-cluster models

NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Exploiting unlabeled data for question classification

NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Question classification for a Croatian QA system

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Toward automated consumer question answering: Automatically separating consumer questions from professional questions in the healthcare domain

Journal of Biomedical Informatics
Using semi-supervised learning for question classification

ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
A multilingual SVM-based question classification system

MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
Improving multiclass text classification with error-correcting output coding and sub-class partitions

AI'10 Proceedings of the 23rd Canadian conference on Advances in Artificial Intelligence
Benchmarking data mining methods in CAT

ICIC'11 Proceedings of the 7th international conference on Advanced Intelligent Computing Theories and Applications: with aspects of artificial intelligence
A support vector machine-based context-ranking model for question answering

Information Sciences: an International Journal
Learning regular expressions to template-based FAQ retrieval systems

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we consider a machine learning technique for question classification. The goal is to replace our regular expression based classifier with a classifier that learns from a set of labeled questions. We have realized that an enourmous amount of time is required to create a rich collection of patterns and keywords for a good coverage of questions in an open-domain application. We decided to use support vector machines, since they have been successfully used for a number of benchmark problems. Although the support vector machines are inherently binary classifiers, it is possible to extend their use as multi-class classifiers using binary codes. We represent questions as frequency weighted vectors of salient terms. We compare our approcah to related work that uses relatively complex syntactic/semantic processing to create features and a sparse network of linear units to classify questions. We provide results to show performance of the method.