An efficient syntactic tagging tool for corpora

  • Authors:
  • Ming Zhou;Changning Huang

  • Affiliations:
  • Tsinghua University, Beijing, China;Tsinghua University, Beijing, China

  • Venue:
  • COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
  • Year:
  • 1994

Quantified Score

Hi-index 0.00

Visualization

Abstract

The tree bank is an important resoures for MT and linguistics researches, but it requires that large number of sentences be annotated with syntactic information. It is time consuming and troublesome, and difficult to keep consistency, if annotation is done manually. In this paper, we presented a new technique for the semi-automatic tagging of Chinese text. The system takes as input Chinese text, and outputs the syntactically tagged sentence(dependency tree). We use dependency grammar and employ a stack based shift/ reduce context-dependent parser as the tagging mechanism. The system works in human-machine cooperative way, in which the machine can acquire tagging rules from human intervention. The automation level can be improved step by step by accumulating rules during annotation. In addition, good consistency of tagging is guaranteed.