Question answering system for incomplete and noisy data: methods and measures for its evaluation

  • Authors:
  • Lili Aunimo;Oskari Heinonen;Reeta Kuuskoski;Juha Makkonen;Renaud Petit;Otso Virtanen

  • Affiliations:
  • University of Helsinki, Department of Computer Science, University of Helsinki, Finland;University of Helsinki, Department of Computer Science, University of Helsinki, Finland;University of Helsinki, Department of Computer Science, University of Helsinki, Finland;University of Helsinki, Department of Computer Science, University of Helsinki, Finland;University of Helsinki, Department of Computer Science, University of Helsinki, Finland;University of Helsinki, Department of Computer Science, University of Helsinki, Finland

  • Venue:
  • ECIR'03 Proceedings of the 25th European conference on IR research
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a question answering system that can handle noisy and incomplete natural language data, and methods and measures for the evaluation of question answering systems. Our question answering system is based on the vector space model and linguistic analysis of the natural language data. In the evaluation procedure, we test eight different preprocessing schemes for the data, and come to the conclusion that lemmatization combined with breaking compound words into their constituents gives significantly better results than the baseline. The evaluation process is based on stratified random sampling and bootstrapping. To measure the correctness of an answer, we use partial credits as well as full credits.