Question classification for a Croatian QA system

Authors:
Tomislav Lombarović;Jan Šnajder;Bojana Dalbelo Bašić
Affiliations:
Faculty of Electrical Engineering and Computing, University of Zagreb, Croatia;Faculty of Electrical Engineering and Computing, University of Zagreb, Croatia;Faculty of Electrical Engineering and Computing, University of Zagreb, Croatia
Venue:
TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Year:
2011

Citing 15
Cited 0

Scaling question answering to the web

ACM Transactions on Information Systems (TOIS)
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
The TREC question answering track

Natural Language Engineering
Analysis of Statistical Question Classification for Fact-Based Questions

Information Retrieval
Learning question classifiers

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Question classification with support vector machines and error correcting codes

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Language morphology offset: Text classification on a Croatian-English parallel corpus

Information Processing and Management: an International Journal
Automatic acquisition of inflectional lexica for morphological normalisation

Information Processing and Management: an International Journal
Baseball: an automatic question-answerer

IRE-AIEE-ACM '61 (Western) Papers presented at the May 9-11, 1961, western joint IRE-AIEE-ACM computer conference
Progress in natural language understanding: an application to lunar geology

AFIPS '73 Proceedings of the June 4-8, 1973, national computer conference and exposition
Developing a question answering system for the slovene language

WSEAS Transactions on Information Science and Applications
Language model based query classification

ECIR'07 Proceedings of the 29th European conference on IR research
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)
BulQA: Bulgarian–bulgarian question answering at CLEF 2005

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Overview of the CLEF 2004 multilingual question answering track

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images

Quantified Score

Hi-index	0.00

Visualization

Abstract

Question Answering (QA) systems provide efficient means for retrieval of information, which in many cases more directly address users' information needs. The performance of a QA system crucially depends on its ability to correctly classify the query question according to the expected answer type. This paper addresses the problem of a question classification for the Croatian language, as a first step towards building an open-domain QA system. We compare different machine learning classifiers on a Croatian test collection based on a two-level question taxonomy. The evaluation results are encouraging and comparable to state-of-the-art results for other languages: accuracy is over 80% for coarse-grained classification and almost 70% for fine-grained classification.