A model for fine-grained alignment of multilingual texts

Authors:
Lea Cyrus;Hendrik Feddes
Affiliations:
University of Münster, Müster, Germany;University of Münster, Müster, Germany
Venue:
MLR '04 Proceedings of the Workshop on Multilingual Linguistic Ressources
Year:
2004

Citing 1
Cited 0

The Penn Treebank: annotating predicate argument structure

HLT '94 Proceedings of the workshop on Human Language Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

While alignment of texts on the sentential level is often seen as being too coarse, and word alignment as being too fine-grained, bi- or multilingual texts which are aligned on a level inbetween are a useful resource for many purposes. Starting from a number of examples of non-literal translations, which tend to make alignment difficult, we describe an alignment model which copes with these cases by explicitly coding them. The model is based on predicateargument structures and thus covers the middle ground between sentence and word alignment. The model is currently used in a recently initiated project of a parallel English-German treebank (FuSe), which can in principle be extended with additional languages.