Mole Recruitment: Poisoning of Image Classifiers via Selective Batch Sampling

Autor:	Wisdom, Ethan, Gokhale, Tejas, Xiao, Chaowei, Yang, Yezhou
Rok vydání:	2023
Předmět:	Computer Science - Machine Learning
Druh dokumentu:	Working Paper
Popis:	In this work, we present a data poisoning attack that confounds machine learning models without any manipulation of the image or label. This is achieved by simply leveraging the most confounding natural samples found within the training data itself, in a new form of a targeted attack coined "Mole Recruitment." We define moles as the training samples of a class that appear most similar to samples of another class, and show that simply restructuring training batches with an optimal number of moles can lead to significant degradation in the performance of the targeted class. We show the efficacy of this novel attack in an offline setting across several standard image classification datasets, and demonstrate the real-world viability of this attack in a continual learning (CL) setting. Our analysis reveals that state-of-the-art models are susceptible to Mole Recruitment, thereby exposing a previously undetected vulnerability of image classifiers.
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/2303.17080 Zobrazit plný text záznamu View this record from Arxiv