Anaphora in czech: large data and experiments with automatic anaphora resolution

  • Authors:
  • Lucie Kučová;Zdeněk Žabokrtský

  • Affiliations:
  • Institute of Formal and Applied Linguistics, Charles University (MFF), Prague, Czech Republic;Institute of Formal and Applied Linguistics, Charles University (MFF), Prague, Czech Republic

  • Venue:
  • TSD'05 Proceedings of the 8th international conference on Text, Speech and Dialogue
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The aim of this paper is two-fold. First, we want to present a part of the annotation scheme of the Prague Dependency Treebank 2.0 related to the annotation of coreference on the tectogrammatical layer of sentence representation (more than 45,000 textual and grammatical coreference links in almost 50,000 manually annotated Czech sentences). Second, we report a new pronoun resolution system developed and tested using the treebank data, the success rate of which is 60.4 %.