A convolutional neural network-based linguistic steganalysis for synonym substitution steganography
Autor: | Ling Yun Xiang, Guo Qing Guo, Peng Yang, Victor S. Sheng, Jing Ming Yu |
---|---|
Rok vydání: | 2020 |
Předmět: |
Word embedding
Computer science convolutional neural network 02 engineering and technology Convolutional neural network synonym substitution 0502 economics and business Synonym (database) QA1-939 0202 electrical engineering electronic engineering information engineering steganalysis steganography Dropout (neural networks) Steganalysis Steganography Applied Mathematics 05 social sciences General Medicine word embedding Linguistics Computational Mathematics Modeling and Simulation Softmax function 020201 artificial intelligence & image processing General Agricultural and Biological Sciences TP248.13-248.65 Mathematics 050203 business & management Sentence Biotechnology |
Zdroj: | Mathematical Biosciences and Engineering, Vol 17, Iss 2, Pp 1041-1058 (2020) |
ISSN: | 1551-0018 |
Popis: | In this paper, a linguistic steganalysis method based on two-level cascaded convolutional neural networks (CNNs) is proposed to improve the system's ability to detect stego texts, which are generated via synonym substitutions. The first-level network, sentence-level CNN, consists of one convolutional layer with multiple convolutional kernels in different window sizes, one pooling layer to deal with variable sentence lengths, and one fully connected layer with dropout as well as a softmax output, such that two final steganographic features are obtained for each sentence. The unmodified and modified sentences, along with their words, are represented in the form of pre-trained dense word embeddings, which serve as the input of the network. Sentence-level CNN provides the representation of a sentence, and can thus be utilized to predict whether a sentence is unmodified or has been modified by synonym substitutions. In the second level, a text-level CNN exploits the predicted representations of sentences obtained from the sentence-level CNN to determine whether the detected text is a stego text or cover text. Experimental results indicate that the proposed sentence-level CNN can effectively extract sentence features for sentence-level steganalysis tasks and reaches an average accuracy of 82.245%. Moreover, the proposed steganalysis method achieves greatly improved detection performance when distinguishing stego texts from cover texts. |
Databáze: | OpenAIRE |
Externí odkaz: |