Cryptologia
Practical Invisibility in Digital Communication
Proceedings of the First International Workshop on Information Hiding
Natural Language Watermarking: Design, Analysis, and a Proof-of-Concept Implementation
IHW '01 Proceedings of the 4th International Workshop on Information Hiding
Natural Language Watermarking and Tamperproofing
IH '02 Revised Papers from the 5th International Workshop on Information Hiding
Semantic forensics: an application of ontological semantics to information assurance
TextMean '04 Proceedings of the 2nd Workshop on Text Meaning and Interpretation
Linguistic steganography using automatically generated paraphrases
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Hi-index | 0.00 |
This paper presents a meaning-based method to distinguish text without or with little semantic content from text that has meaning which can be processed. The basic method assumes that a semantic analyzer will be able to produce less output from semantically less grammatical input text. The method was pilot-tested on a corpus of blog spam. Future improvements, including a method to distinguish semantically unified from semantically disparate text are sketched. The tested method, but even more the projected improvements, open up the way to taking the spam filtering arms race to a new level that is very costly to spam producers.