Discursive usage of six Chinese punctuation marks
COLING ACL '06 Proceedings of the 21st International Conference on computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
Hi-index | 0.00 |
With their high occurrence rates in argumentative Chinese texts, discourse markers play a significant role in the automatic processing of these kinds of Chinese texts, such as automatic summarization. This paper reports on an effort in applying machine learning to identify discourse markers in Chinese. We have processed 80 Chinese texts from which we have selected subsets for data training and data testing. We used C4.5 in our experiments and obtained accuracy of the order of 80%. Accuracy obtained by neural network are a bit worse than that of C4.5. We also interpret and analyze our experimental results in the linguistic perspective.