A language independent method for question classification

  • Authors:
  • Thamar Solorio;Manuel Pérez-Coutiño;Manuel Montes-y-Gémez;Luis Villaseñor-Pineda;Aurelio López-López

  • Affiliations:
  • National Institute of Astrophysics, Tonantzintla, Puebla, Mexico;National Institute of Astrophysics, Tonantzintla, Puebla, Mexico;National Institute of Astrophysics, Tonantzintla, Puebla, Mexico and Universidad Politécnica de Valencia, España;National Institute of Astrophysics, Tonantzintla, Puebla, Mexico;National Institute of Astrophysics, Tonantzintla, Puebla, Mexico

  • Venue:
  • COLING '04 Proceedings of the 20th international conference on Computational Linguistics
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Previous works on question classification are based on complex natural language processing techniques: named entity extractors, parsers, chunkers, etc. While these approaches have proven to be effective they have the disadvantage of being targeted to a particular language. We present here a simple approach that exploits lexical features and the Internet to train a classifier, namely a Support Vector Machine. The main feature of this method is that it can be applied to different languages without requiring major modifications. Experimental results of this method on English, Italian and Spanish show that this approach can be a practical tool for question answering systems, reaching a classification accuracy as high as 88.92%.