Natural language watermarking via morphosyntactic alterations

  • Authors:
  • Hasan Mesut Meral;Bülent Sankur;A. Sumru Özsoy;Tunga Güngör;Emre Sevinç

  • Affiliations:
  • Boğaziçi University, Linguistics Program, Bebek, İstanbul 34342, Turkey;Boğaziçi University, Department of Electrical and Electronic Engineering, Bebek, İstanbul 34342, Turkey;Boğaziçi University, Linguistics Program, Bebek, İstanbul 34342, Turkey and Boğaziçi University, Cognitive Science Program, Bebek, İstanbul 34342, Turkey;Boğaziçi University, Cognitive Science Program, Bebek, İstanbul 34342, Turkey and Boğaziçi University, Department of Computer Engineering, Bebek, İstanbul 34342, Turk ...;Boğaziçi University, Cognitive Science Program, Bebek, İstanbul 34342, Turkey

  • Venue:
  • Computer Speech and Language
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We develop a morphosyntax-based natural language watermarking scheme. In this scheme, a text is first transformed into a syntactic tree diagram where the hierarchies and the functional dependencies are made explicit. The watermarking software then operates on the sentences in syntax tree format and executes binary changes under control of Wordnet and Dictionary to avoid semantic drops. A certain level of security is provided via key-controlled randomization of morphosyntactic tools and the insertion of void watermark. The security aspects and payload aspects are evaluated statistically while the imperceptibility is measured using edit-hit counts based on human judgments. It is observed that agglutinative languages are somewhat more amenable to morphosyntax-based natural language watermarking and the free word order property of a language, like Turkish, is an extra bonus.