Comparing phrase-based and syntax-based paraphrase generation

  • Authors:
  • Sander Wubben;Erwin Marsi;Antal van den Bosch;Emiel Krahmer

  • Affiliations:
  • Tilburg University, LE Tilburg, The Netherlands;NTNU, Sem Saelandsvei, Trondheim, Norway;Tilburg University, LE Tilburg, The Netherlands;Tilburg University, LE Tilburg, The Netherlands

  • Venue:
  • MTTG '11 Proceedings of the Workshop on Monolingual Text-To-Text Generation
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Paraphrase generation can be regarded as machine translation where source and target language are the same. We use the Moses statistical machine translation toolkit for paraphrasing, comparing phrase-based to syntax-based approaches. Data is derived from a recently released, large scale (2.1M tokens) paraphrase corpus for Dutch. Preliminary results indicate that the phrase-based approach performs better in terms of NIST scores and produces paraphrases at a greater distance from the source.