Enhancement of a Chinese discourse marker tagger with C4.5

  • Authors:
  • Benjamin K. T'sou;Tom B. Y. Lai;Samuel W. K. Chan;Weijun Gao;Xuegang Zhan

  • Affiliations:
  • City University of Hong Kong, Kowloon, Hong Kong SAR, China;City University of Hong Kong, Kowloon, Hong Kong SAR, China;City University of Hong Kong, Kowloon, Hong Kong SAR, China;Northeastern University, China;Northeastern University, China

  • Venue:
  • CLPW '00 Proceedings of the second workshop on Chinese language processing: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 12
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Discourse markers are complex discontinuous linguistic expressions which are used to explicitly signal the discourse structure of a text. This paper describes efforts to improve an automatic tagging system which identifies and classifies discourse markers in Chinese texts by applying machine learning (ML) to the disambiguation of discourse markers, as an integral part of automatic text summarization via rhetorical structure. Encouraging results are reported.