Data sanitization against label flipping attacks using AdaBoost-based semi-supervised learning technology
Autor: | Zhanbo Li, Ning Cheng, Hongpo Zhang |
---|---|
Rok vydání: | 2021 |
Předmět: |
Computer science
business.industry Computational intelligence Pattern recognition Semi-supervised learning Theoretical Computer Science Support vector machine ComputingMethodologies_PATTERNRECOGNITION Robustness (computer science) Classifier (linguistics) Entropy (information theory) Geometry and Topology Artificial intelligence Noise (video) AdaBoost business Software |
Zdroj: | Soft Computing. 25:14573-14581 |
ISSN: | 1433-7479 1432-7643 |
DOI: | 10.1007/s00500-021-06384-y |
Popis: | The label flipping attack is a special poisoning attack in the adversarial environment. The research designed a novel label noise processing framework, the core of which is the semi-supervised learning label correction algorithm based on AdaBoost (AdaSSL). It can effectively improve the label quality of training data and improve the classification performance of the model. Based on five real UCI datasets, this study chose six classic machine learning algorithms (NB, LR, SVM, DT, KNN and MLP) as the base classifiers to classify them. With a noise level of 0 $$ \sim $$ 20%, we evaluated the classification effect of these classifiers on UCI datasets based on the entropy label flipping attack and the AdaSSL defense algorithm. The experimental results show that the AdaSSL algorithm can effectively improve the robustness of the classifier against label flipping attack. Compared with the most advanced semi-supervised defense algorithm in the literature, the algorithm does not need to use additional datasets. At a noise ratio of 10%, the AdaSSL algorithm is significantly better than state-of-the-art label noise defense technology. |
Databáze: | OpenAIRE |
Externí odkaz: |