A simple hybrid aligner for generating lexical correspondences in parallel texts

  • Authors:
  • Lars Ahrenberg;Mikael Andersson;Magnus Merkel

  • Affiliations:
  • Linköping University, Linköping, Sweden;Linköping University, Linköping, Sweden;Linköping University, Linköping, Sweden

  • Venue:
  • COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present an algorithm for bilingual word alignment that extends previous work by treating multi-word candidates on a par with single words, and combining some simple assumptions about the translation process to capture alignments for low frequency words. As most other alignment algorithms it uses cooccurrence statistics as a basis, but differs in the assumptions it makes about the translation process. The algorithm has been implemented in a modular system that allows the user to experiment with different combinations and variants of these assumptions. We give performance results from two evaluations, which compare will with results reported in the literature.