Detection of synonym-substitution modified articles using context information

  • Zhenshan Yu*
  • , Liusheng Huang
  • , Zhili Chen
  • , Lingjun Li
  • , Xinxin Zhao
  • , Youwen Zhu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

Text steganography usually modifies the cover-text (where secrets are embedded) in some meaning-preserving ways to conceal secret messages, while steganalysis does the opposite - detects or extracts the secrets. A lot of work has been done on steganography, but only a little on steganalysis. In this paper, we analyze one kind of text steganography that use synonym substitution. We try to distinguish between modified articles and unmodified articles using context information. We evaluate the suitability of words for their context, and then the suitability sequence of words leads to the final judgment made by a SVM (support vector machine) classifier. IDF (inverse document frequency) is used to weight words' suitability in order to balance common words and rare ones. This scheme is evaluated on internet instead of in a specific corpus, with the help of Google. Experimental results show that classification accuracy achieves 90.0%.

Original languageEnglish
Title of host publicationProceedings of the 2008 2nd International Conference on Future Generation Communication and Networking, FGCN 2008
Pages134-139
Number of pages6
DOIs
StatePublished - 2008
Externally publishedYes
Event2008 2nd International Conference on Future Generation Communication and Networking, FGCN 2008 - Hainan Island, China
Duration: 13 Dec 200815 Dec 2008

Publication series

NameProceedings of the 2008 2nd International Conference on Future Generation Communication and Networking, FGCN 2008
Volume1

Conference

Conference2008 2nd International Conference on Future Generation Communication and Networking, FGCN 2008
Country/TerritoryChina
CityHainan Island
Period13/12/0815/12/08

Fingerprint

Dive into the research topics of 'Detection of synonym-substitution modified articles using context information'. Together they form a unique fingerprint.

Cite this