CT-GAT: Cross-Task Generative Adversarial Attack based on Transferability

Autor:	Lv, Minxuan, Dai, Chengwei, Li, Kun, Zhou, Wei, Hu, Songlin
Rok vydání:	2023
Předmět:	Computer Science - Computation and Language
Druh dokumentu:	Working Paper
Popis:	Neural network models are vulnerable to adversarial examples, and adversarial transferability further increases the risk of adversarial attacks. Current methods based on transferability often rely on substitute models, which can be impractical and costly in real-world scenarios due to the unavailability of training data and the victim model's structural details. In this paper, we propose a novel approach that directly constructs adversarial examples by extracting transferable features across various tasks. Our key insight is that adversarial transferability can extend across different tasks. Specifically, we train a sequence-to-sequence generative model named CT-GAT using adversarial sample data collected from multiple tasks to acquire universal adversarial features and generate adversarial examples for different tasks. We conduct experiments on ten distinct datasets, and the results demonstrate that our method achieves superior attack performance with small cost. Comment: Accepted to EMNLP 2023 main conference Corrected the header error in Figure 3
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/2310.14265 Zobrazit plný text záznamu View this record from Arxiv