A Statistical Algorithm for Linguistic Steganography Detection Based on Distribution of Words

Authors:
Chen Zhi-li;Huang Liu-sheng;Yu Zhen-shan;Li Ling-jun;Yang Wei
Affiliations:
-;-;-;-;-
Venue:
ARES '08 Proceedings of the 2008 Third International Conference on Availability, Reliability and Security
Year:
2008

Citing 0
Cited 3

STBS: a statistical algorithm for steganalysis of translation-based steganography

IH'10 Proceedings of the 12th international conference on Information hiding
Blind linguistic steganalysis against translation based steganography

IWDW'10 Proceedings of the 9th international conference on Digital watermarking
LinL: lost in n-best list

IH'11 Proceedings of the 13th international conference on Information hiding

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a novel statistical algorithm for linguistic steganography detection, which takes advantage of distribution of words in the text segment detected, is presented. Linguistic steganography is the art of using written natural language to hide the very presence of secret messages. Using the text data, which is the foundational media in internet communications, as its carrier, linguistic steganography plays an important part in Information Hiding (IH) area. The previous work was mainly focused on linguistic steganography and there were few researches on linguistic steganalisys. We attempt to do something to help to fix this gap. In our experiment of detecting the three different linguistic steganography methods: NICETEXT,TEXTO and Markov-Chain-Based, the total accuracies on discovering stego-text segments and normal text segmentsare found to be 87.39%, 95.51%, 98.50%, 99.15% and 99.57% respectively when the segment size is 5kB, 10kB,20kB, 30kB and 40kB. Our research shows that the linguistic steganalysis based on distribution of words is promising.