Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
Hi-index | 0.00 |
In this paper, we propose a sentence segmentation model for a semi-automatic tree annotation tool using a parsing model. For the purpose of improving both parsing performance and parsing complexity without any modification of the parsing model, the tree annotation tool performs two-phase parsing for the intra-structure of each segment and the inter-structure of the segments after segmenting a sentence. Experimental results show that it can reduce manual effort about 28.3% by the proposed sentence segmentation model because an annotator’s intervention related to cancellation and reconstruction remarkably decrease.