Topological field chunking for German

  • Authors:
  • Jorn Veenstra;Frank Henrik Müller;Tylman Ule

  • Affiliations:
  • Universität Tübingen;Universität Tübingen;Universität Tübingen

  • Venue:
  • COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we compare three different approaches to the analysis of the basic structure in German sentences: the sentence brackets in the topological field framework in German (Höhle, 1986). The first approach is based on hand-written Finite-State Automata (FSA); the other two are trained on corpus data. One is a Probabilistic Context-Free Grammar (PCFG) approach, the other is a classification-based Memory-Based Learning (MBL) approach. The three approaches are evaluated on a manually annotated corpus. We will show that the Fβ=1 value for this task is around 94% for all three approaches, which suggests that this is a fruitful first step for parsing and analysing German text.