TnT: a statistical part-of-speech tagger
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A divide-and-conquer strategy for shallow parsing of German free texts
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Annotating topological fields and chunks: and revising POS tags at the same time
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Topological field parsing of German
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Hi-index | 0.00 |
In this paper we compare three different approaches to the analysis of the basic structure in German sentences: the sentence brackets in the topological field framework in German (Höhle, 1986). The first approach is based on hand-written Finite-State Automata (FSA); the other two are trained on corpus data. One is a Probabilistic Context-Free Grammar (PCFG) approach, the other is a classification-based Memory-Based Learning (MBL) approach. The three approaches are evaluated on a manually annotated corpus. We will show that the Fβ=1 value for this task is around 94% for all three approaches, which suggests that this is a fruitful first step for parsing and analysing German text.