Natural Language Watermarking and Tamperproofing

  • Authors:
  • Mikhail J. Atallah;Victor Raskin;Christian Hempelmann;Mercan Karahan;Radu Sion;Umut Topkara;Katrina E. Triezenberg

  • Affiliations:
  • -;-;-;-;-;-;-

  • Venue:
  • IH '02 Revised Papers from the 5th International Workshop on Information Hiding
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Two main results in the area of information hiding in natural language text are presented. A semantically-based scheme dramatically improves the information-hiding capacity of any text through two techniques: (i) modifying the granularity of meaning of individual sentences, whereas our own previous scheme kept the granularity fixed, and (ii) halving the number of sentences affected by the watermark. No longer a "long text, short watermark" approach, it now makes it possible to watermark short texts, like wire agency reports. Using both the above-mentioned semantic marking scheme and our previous syntactically-based method hides information in a way that reveals any non-trivial tampering with the text (while re-formatting is not considered to be tampering--the problem would be solved trivially otherwise by hiding a hash of the text) with a probability 1-2-脽 (n+1), n being its number of sentences and 脽 a small positive integer based on the extent of co-referencing.