Linguistic steganography detection using statistical characteristics of correlations between words

  • Zhili Chen*
  • , Liusheng Huang
  • , Zhenshan Yu
  • , Wei Yang
  • , Lingjun Li
  • , Xueling Zheng
  • , Xinxin Zhao
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

30 Scopus citations

Abstract

Linguistic steganography is a branch of Information Hiding (IH) using written natural language to conceal secret messages. It plays an important role in Information Security (IS) area. Previous work on linguistic steganography was mainly focused on steganography and there were few researches on attacks against it. In this paper, a novel statistical algorithm for linguistic steganography detection is presented. We use the statistical characteristics of correlations between the general service words gathered in a dictionary to classify the given text segments into stego-text segments and normal text segments. In the experiment of blindly detecting the three different linguistic steganography approaches: Markov-Chain-Based, NICETEXT and TEXTO, the total accuracy of discovering stego-text segments and normal text segments is found to be 97.19%. Our results show that the linguistic steganalysis based on correlations between words is promising.

Original languageEnglish
Title of host publicationInformation Hiding - 10th International Workshop, IH 2008, Revised Selected Papers
PublisherSpringer Verlag
Pages224-235
Number of pages12
ISBN (Print)3540889604, 9783540889601
DOIs
StatePublished - 2008
Externally publishedYes
Event10th International Workshop on Information Hiding, IH 2008 - Santa Barbara, CA, United States
Duration: 19 May 200821 May 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5284 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th International Workshop on Information Hiding, IH 2008
Country/TerritoryUnited States
CitySanta Barbara, CA
Period19/05/0821/05/08

Fingerprint

Dive into the research topics of 'Linguistic steganography detection using statistical characteristics of correlations between words'. Together they form a unique fingerprint.

Cite this