Watermarking the outputs of structured prediction with an application in statistical machine translation

  • Authors:
  • Ashish Venugopal;Jakob Uszkoreit;David Talbot;Franz J. Och;Juri Ganitkevitch

  • Affiliations:
  • Google, Inc., Amphitheatre Parkway, Mountain View, CA;Google, Inc., Amphitheatre Parkway, Mountain View, CA;Google, Inc., Amphitheatre Parkway, Mountain View, CA;Google, Inc., Amphitheatre Parkway, Mountain View, CA;Johns Hopkins University, Baltimore, MD

  • Venue:
  • EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and provides well defined trade-offs between the ability to identify algorithm outputs and the quality of the watermarked output. Unlike previous work in the field, our approach does not rely on controlling the inputs to the algorithm and provides probabilistic guarantees on the ability to identify collections of results from one's own algorithm. We present an application in statistical machine translation, where machine translated output is watermarked at minimal loss in translation quality and detected with high recall.