Applying Machine Learning to Identify Chinese Discourse Markers

  • Authors:
  • Benjamin K. Tsou

  • Affiliations:
  • -

  • Venue:
  • ICIIS '99 Proceedings of the 1999 International Conference on Information Intelligence and Systems
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

With their high occurrence rates in argumentative Chinese texts, discourse markers play a significant role in the automatic processing of these kinds of Chinese texts, such as automatic summarization. This paper reports on an effort in applying machine learning to identify discourse markers in Chinese. We have processed 80 Chinese texts from which we have selected subsets for data training and data testing. We used C4.5 in our experiments and obtained accuracy of the order of 80%. Accuracy obtained by neural network are a bit worse than that of C4.5. We also interpret and analyze our experimental results in the linguistic perspective.