Semantic chunk annotation for complex questions using conditional random field

  • Authors:
  • Shixi Fan;Wing W. Y. Ng;Xiaolong Wang;Yaoyun Zhang;Xuan Wang

  • Affiliations:
  • Harbin Institute of Technology, Shenzhen, China;Harbin Institute of Technology, Shenzhen, China;Harbin Institute of Technology, Shenzhen, China;Harbin Institute of Technology, Shenzhen, China;Harbin Institute of Technology, Shenzhen, China

  • Venue:
  • KRAQ '08 Coling 2008: Proceedings of the workshop on Knowledge and Reasoning for Answering Questions
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a CRF (Conditional Random Field) model for Semantic Chunk Annotation in a Chinese Question and Answering System (SCACQA). The model was derived from a corpus of real world questions, which are collected from some discussion groups on the Internet. The questions are supposed to be answered by other people, so some of the questions are very complex. Mutual information was adopted for feature selection. The training data collection consists of 14000 sentences and the testing data collection consists of 4000 sentences. The result shows an F-score of 93.07%.