Linguistically-based sub-sentential alignment for terminology extraction from a bilingual automotive corpus

  • Authors:
  • Lieve Macken;Els Lefever;Veronique Hoste

  • Affiliations:
  • Ghent University College, Belgium;Ghent University College, Belgium;Ghent University College, Belgium

  • Venue:
  • COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a sub-sentential alignment system that links linguistically motivated phrases in parallel texts based on lexical correspondences and syntactic similarity. We compare the performance of our sub-sentential alignment system with different symmetrization heuristics that combine the GIZA++ alignments of both translation directions. We demonstrate that the aligned linguistically motivated phrases are a useful means to extract bilingual terminology and more specifically complex multiword terms.