Spell checking techniques for replacement of unknown words and data cleaning for Haitian Creole SMS translation

  • Authors:
  • Sara Stymne

  • Affiliations:
  • Linköping University, Sweden

  • Venue:
  • WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We report results on translation of SMS messages from Haitian Creole to English. We show improvements by applying spell checking techniques to unknown words and creating a lattice with the best known spelling equivalents. We also used a small cleaned corpus to train a cleaning model that we applied to the noisy corpora.