Combining clues for lexical level aligning using the null hypothesis approach

  • Authors:
  • Olivier Kraif;Boxing Chen

  • Affiliations:
  • LIDILEM, Université Stendhal, Grenoble, France;LIDILEM, Université Stendhal, Grenoble, France

  • Venue:
  • COLING '04 Proceedings of the 20th international conference on Computational Linguistics
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Various informations can be used to align parallel texts at word level: co-occurrence frequencies, position difference, part-of-speech, graphic resemblance, etc. This paper proposes a simple method to combine these clues in an efficient way. The association score is computed from the probabilities of pairing two units under Null hypothesis, assuming that the association is fortuitous. This approach has been applied to a literary English-French parallel text with good results.