Question pre-processing in a QA system on Internet discussion groups

  • Authors:
  • Chuan-Jie Lin;Chun-Hung Cho

  • Affiliations:
  • National Taiwan Ocean University, Keelung, Taiwan, R.O.C;National Taiwan Ocean University, Keelung, Taiwan, R.O.C

  • Venue:
  • SumQA '06 Proceedings of the Workshop on Task-Focused Summarization and Question Answering
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes methods to pre-process questions in the postings before a QA system can find answers in a discussion group in the Internet. Pre-processing includes garbage text removal and question segmentation. Garbage keywords are collected and different length thresholds are assigned to them for garbage text identification. Interrogative forms and question types are used to segment questions. The best performance on the test set achieves 92.57% accuracy in garbage text removal and 85.87% accuracy in question segmentation, respectively.