A parse-and-trim approach with information significance for Chinese sentence compression

  • Authors:
  • Wei Xu;Ralph Grishman

  • Affiliations:
  • New York University, New York, NY;New York University, New York, NY

  • Venue:
  • UCNLG+Sum '09 Proceedings of the 2009 Workshop on Language Generation and Summarisation
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose an event-based approach for Chinese sentence compression without using any training corpus. We enhance the linguistically-motivated heuristics by exploiting event word significance and event information density. This is shown to improve the preservation of important information and the tolerance of POS and parsing errors, which are more common in Chinese than English. The heuristics are only required to determine possibly removable constituents instead of selecting specific constituents for removal, and thus are easier to develop and port to other languages and domains. The experimental results show that around 72% of our automatic compressions are grammatically and semantically correct, preserving around 69% of the most important information on average.