GANobfuscator: Mitigating Information Leakage Under GAN via Differential Privacy

Autor:	Zhan Qin, Ju Ren, Deyu Zhang, Kui Ren, Chugui Xu, Yaoxue Zhang
Rok vydání:	2019
Předmět:	021110 strategic defence & security studies Training set Computer Networks and Communications Computer science 0211 other engineering and technologies Stability (learning theory) 02 engineering and technology computer.software_genre Data modeling Generative model Information leakage Benchmark (computing) Differential privacy Data mining Safety Risk Reliability and Quality computer
Zdroj:	IEEE Transactions on Information Forensics and Security. 14:2358-2371
ISSN:	1556-6021 1556-6013
Popis:	By learning generative models of semantic-rich data distributions from samples, generative adversarial network (GAN) has recently attracted intensive research interests due to its excellent empirical performance as a generative model. The model is used to estimate the underlying distribution of a dataset and randomly generate realistic samples according to their estimated distribution. However, GANs can easily remember training samples due to the high model complexity of deep networks. When GANs are applied to private or sensitive data, the concentration of distribution may divulge some critical information. It consequently requires new technological advances to mitigate the information leakage under GANs. To address this issue, we propose GANobfuscator, a differentially private GAN, which can achieve differential privacy under GANs by adding carefully designed noise to gradients during the learning procedure. With GANobfuscator, analysts are able to generate an unlimited amount of synthetic data for arbitrary analysis tasks without disclosing the privacy of training data. Moreover, we theoretically prove that GANobfuscator can provide strict privacy guarantee with differential privacy. In addition, we develop a gradient-pruning strategy for GANobfuscator to improve the scalability and stability of data training. Through extensive experimental evaluation on benchmark datasets, we demonstrate that GANobfuscator can produce high-quality generated data and retain desirable utility under practical privacy budgets.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::82c40d215f111d39a743c3a6748c7e63 https://doi.org/10.1109/tifs.2019.2897874 Zobrazit plný text záznamu