A New Model for Chinese Short-text Classification Considering Feature Extension

  • Authors:
  • Xinghua Fan;Hongge Hu

  • Affiliations:
  • -;-

  • Venue:
  • AICI '10 Proceedings of the 2010 International Conference on Artificial Intelligence and Computational Intelligence - Volume 02
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

this paper presents a new model for classifying Chinese short-text that have weak concept signal, in which three key factors on feature extension, which would determine the classification performance of short-text, are considered. For the sake of determining the three extension factors, this paper studied the three key issues as follows: (1) how we do feature extension for short-text; (2) what influence the different ways of feature extension do to classification performance of short-text; (3) how we control the degree of feature extension for short text. In the stage of classification, a short-text is first extended by adding new features or modifying the weights of initial features according to the relationship between non-feature terms and feature extension mode; meanwhile, we would improve the effect of feature extension by controlling the degree of feature extension, and then classify the extended short-text with the new model. The experimental results show that the new model proposed for short-text classification considering feature extension can obtain higher classification performance comparing with the conventional classification methods.