Multilingual annotation and disambiguation of discourse connectives for machine translation

  • Authors:
  • Thomas Meyer;Andrei Popescu-Belis;Sandrine Zufferey;Bruno Cartoni

  • Affiliations:
  • Idiap Research Institute, Rue Marconi, Martigny, Switzerland;Idiap Research Institute, Rue Marconi, Martigny, Switzerland;University of Geneva, Geneva, Switzerland;University of Geneva, Geneva, Switzerland

  • Venue:
  • SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many discourse connectives can signal several types of relations between sentences. Their automatic disambiguation, i.e. the labeling of the correct sense of each occurrence, is important for discourse parsing, but could also be helpful to machine translation. We describe new approaches for improving the accuracy of manual annotation of three discourse connectives (two English, one French) by using parallel corpora. An appropriate set of labels for each connective can be found using information from their translations. Our results for automatic disambiguation are state-of-the-art, at up to 85% accuracy using surface features. Using feature analysis, contextual features are shown to be useful across languages and connectives.