A Statistical Algorithm for Linguistic Steganography Detection Based on Distribution of Words

  • Authors:
  • Chen Zhi-li;Huang Liu-sheng;Yu Zhen-shan;Li Ling-jun;Yang Wei

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • ARES '08 Proceedings of the 2008 Third International Conference on Availability, Reliability and Security
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a novel statistical algorithm for linguistic steganography detection, which takes advantage of distribution of words in the text segment detected, is presented. Linguistic steganography is the art of using written natural language to hide the very presence of secret messages. Using the text data, which is the foundational media in internet communications, as its carrier, linguistic steganography plays an important part in Information Hiding (IH) area. The previous work was mainly focused on linguistic steganography and there were few researches on linguistic steganalisys. We attempt to do something to help to fix this gap. In our experiment of detecting the three different linguistic steganography methods: NICETEXT,TEXTO and Markov-Chain-Based, the total accuracies on discovering stego-text segments and normal text segmentsare found to be 87.39%, 95.51%, 98.50%, 99.15% and 99.57% respectively when the segment size is 5kB, 10kB,20kB, 30kB and 40kB. Our research shows that the linguistic steganalysis based on distribution of words is promising.