Learning question classifiers: the role of semantic information

  • Authors:
  • Xin Li;Dan Roth

  • Affiliations:
  • Department of Computer Science, University of Illinois at Urbana-Champaign, IL 61801, USA e-mail: xli1@uiuc.edu, danr@uiuc.edu;Department of Computer Science, University of Illinois at Urbana-Champaign, IL 61801, USA e-mail: xli1@uiuc.edu, danr@uiuc.edu

  • Venue:
  • Natural Language Engineering
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

To respond correctly to a free form factual question given a large collection of text data, one needs to understand the question to a level that allows determining some of the constraints the question imposes on a possible answer. These constraints may include a semantic classification of the sought after answer and may even suggest using different strategies when looking for and verifying a candidate answer. This work presents a machine learning approach to question classification. Guided by a layered semantic hierarchy of answer types, we develop a hierarchical classifier that classifies questions into fine-grained classes. This work also performs a systematic study of the use of semantic information sources in natural language classification tasks. It is shown that, in the context of question classification, augmenting the input of the classifier with appropriate semantic category information results in significant improvements to classification accuracy. We show accurate results on a large collection of free-form questions used in TREC 10 and 11.