DunDi: Improving Robustness of Neural Networks Using Distance Metric Learning
Autor: | Rong-rong Xi, Lei Cui, Xuehao Yu, Lei Zhang, Zhiyu Hao |
---|---|
Rok vydání: | 2019 |
Předmět: | |
Zdroj: | Lecture Notes in Computer Science ISBN: 9783030227401 ICCS (2) |
Popis: | The deep neural networks (DNNs), although highly accurate, are vulnerable to adversarial attacks. A slight perturbation applied to a sample may lead to misprediction of the DNN, even it is imperceptible to humans. This defect makes the DNN lack of robustness to malicious perturbations, and thus limits their usage in many safety-critical systems. To this end, we present DunDi, a metric learning based classification model, to provide the ability to defend adversarial attacks. The key idea behind DunDi is a metric learning model which is able to pull samples of the same label together meanwhile pushing samples of different labels away. Consequently, the distance between samples and model’s boundary can be enlarged accordingly, so that significant perturbations are required to fool the model. Then, based on the distance comparison, we propose a two-step classification algorithm that performs efficiently for multi-class classification. DunDi can not only build and train a new customized model but also support the incorporation of the available pre-trained neural network models to take full advantage of their capabilities. The results show that DunDi is able to defend 94.39% and 88.91% of adversarial samples generated by four state-of-the-art adversarial attacks on the MNIST dataset and CIFAR-10 dataset, without hurting classification accuracy. |
Databáze: | OpenAIRE |
Externí odkaz: |