Discursive usage of six Chinese punctuation marks

Authors:
Yue Ming
Affiliations:
Communication University of China, Beijing, China
Venue:
COLING ACL '06 Proceedings of the 21st International Conference on computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
Year:
2006

Citing 10
Cited 0

Applying Machine Learning to Identify Chinese Discourse Markers

ICIIS '99 Proceedings of the 1999 International Conference on Information Intelligence and Systems
The rhetorical parsing, summarization, and generation of natural language texts

The rhetorical parsing, summarization, and generation of natural language texts
The rhetorical parsing, summarization, and generation of natural language texts

The rhetorical parsing, summarization, and generation of natural language texts
Critical tokenization and its properties

Computational Linguistics
Chinese word segmentation without using lexicon and hand-crafted training data

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Tokenization as the initial phase in NLP

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 4
Building a discourse-tagged corpus in the framework of Rhetorical Structure Theory

SIGDIAL '01 Proceedings of the Second SIGdial Workshop on Discourse and Dialogue - Volume 16
Chinese word segmentation in MSR-NLP

SIGHAN '03 Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17
Mining discourse markers for Chinese textual summarization

NAACL-ANLP-AutoSum '00 Proceedings of the 2000 NAACL-ANLP Workshop on Automatic Summarization
The Potsdam commentary corpus

DiscAnnotation '04 Proceedings of the 2004 ACL Workshop on Discourse Annotation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Both rhetorical structure and punctuation have been helpful in discourse processing. Based on a corpus annotation project, this paper reports the discursive usage of 6 Chinese punctuation marks in news commentary texts: Colon, Dash, Ellipsis, Exclamation Mark, Question Mark, and Semicolon. The rhetorical patterns of these marks are compared against patterns around cue phrases in general. Results show that these Chinese punctuation marks, though fewer in number than cue phrases, are easy to identify, have strong correlation with certain relations, and can be used as distinctive indicators of nuclearity in Chinese texts.