Data sanitization against label flipping attacks using AdaBoost-based semi-supervised learning technology

Autor:	Zhanbo Li, Ning Cheng, Hongpo Zhang
Rok vydání:	2021
Předmět:	Computer science business.industry Computational intelligence Pattern recognition Semi-supervised learning Theoretical Computer Science Support vector machine ComputingMethodologies_PATTERNRECOGNITION Robustness (computer science) Classifier (linguistics) Entropy (information theory) Geometry and Topology Artificial intelligence Noise (video) AdaBoost business Software
Zdroj:	Soft Computing. 25:14573-14581
ISSN:	1433-7479 1432-7643
DOI:	10.1007/s00500-021-06384-y
Popis:	The label flipping attack is a special poisoning attack in the adversarial environment. The research designed a novel label noise processing framework, the core of which is the semi-supervised learning label correction algorithm based on AdaBoost (AdaSSL). It can effectively improve the label quality of training data and improve the classification performance of the model. Based on five real UCI datasets, this study chose six classic machine learning algorithms (NB, LR, SVM, DT, KNN and MLP) as the base classifiers to classify them. With a noise level of 0 $$ \sim $$ 20%, we evaluated the classification effect of these classifiers on UCI datasets based on the entropy label flipping attack and the AdaSSL defense algorithm. The experimental results show that the AdaSSL algorithm can effectively improve the robustness of the classifier against label flipping attack. Compared with the most advanced semi-supervised defense algorithm in the literature, the algorithm does not need to use additional datasets. At a noise ratio of 10%, the AdaSSL algorithm is significantly better than state-of-the-art label noise defense technology.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::761982bdeef11040af36293449d0e4c5 https://doi.org/10.1007/s00500-021-06384-y Zobrazit plný text záznamu Full text from SpringerLink