ActiveThief: Model Extraction Using Active Learning and Unannotated Public Data

Autor:	Aditya Kanade, Yash Gupta, Shirish Shevade, Vinod Ganapathy, Soham Pal, Aditya Shukla
Rok vydání:	2020
Předmět:	Model extraction Service (systems architecture) Application programming interface Computer science Active learning (machine learning) business.industry General Medicine Machine learning computer.software_genre Image (mathematics) Variety (cybernetics) Active learning Domain knowledge Artificial intelligence business computer
Zdroj:	AAAI
ISSN:	2374-3468 2159-5399
Popis:	Machine learning models are increasingly being deployed in practice. Machine Learning as a Service (MLaaS) providers expose such models to queries by third-party developers through application programming interfaces (APIs). Prior work has developed model extraction attacks, in which an attacker extracts an approximation of an MLaaS model by making black-box queries to it. We design ActiveThief – a model extraction framework for deep neural networks that makes use of active learning techniques and unannotated public datasets to perform model extraction. It does not expect strong domain knowledge or access to annotated data on the part of the attacker. We demonstrate that (1) it is possible to use ActiveThief to extract deep classifiers trained on a variety of datasets from image and text domains, while querying the model with as few as 10-30% of samples from public datasets, (2) the resulting model exhibits a higher transferability success rate of adversarial examples than prior work, and (3) the attack evades detection by the state-of-the-art model extraction detection method, PRADA.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::141e8c5509fae75e8d823f2430becb88 https://doi.org/10.1609/aaai.v34i01.5432 Zobrazit plný text záznamu