Detecting semantic equivalence and information disparity in cross-lingual documents

  • Authors:
  • Yashar Mehdad;Matteo Negri;Marcello Federico

  • Affiliations:
  • Fondazione Bruno Kessler, FBK-irst, Trento, Italy;Fondazione Bruno Kessler, FBK-irst, Trento, Italy;Fondazione Bruno Kessler, FBK-irst, Trento, Italy

  • Venue:
  • ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We address a core aspect of the multilingual content synchronization task: the identification of novel, more informative or semantically equivalent pieces of information in two documents about the same topic. This can be seen as an application-oriented variant of textual entailment recognition where: i) T and H are in different languages, and ii) entailment relations between T and H have to be checked in both directions. Using a combination of lexical, syntactic, and semantic features to train a cross-lingual textual entailment system, we report promising results on different datasets.