Pairwise Learning with Differential Privacy Guarantees

Autor:	Aidong Zhang, Di Wang, Chenglin Miao, Jinhui Xu, Mengdi Huai
Rok vydání:	2020
Předmět:	Online and offline Training set business.industry Computer science 02 engineering and technology General Medicine Maximization Machine learning computer.software_genre 020204 information systems Metric (mathematics) 0202 electrical engineering electronic engineering information engineering Differential privacy 020201 artificial intelligence & image processing Artificial intelligence business Convex function computer
Zdroj:	AAAI
ISSN:	2374-3468 2159-5399
DOI:	10.1609/aaai.v34i01.5411
Popis:	Pairwise learning has received much attention recently as it is more capable of modeling the relative relationship between pairs of samples. Many machine learning tasks can be categorized as pairwise learning, such as AUC maximization and metric learning. Existing techniques for pairwise learning all fail to take into consideration a critical issue in their design, i.e., the protection of sensitive information in the training set. Models learned by such algorithms can implicitly memorize the details of sensitive information, which offers opportunity for malicious parties to infer it from the learned models. To address this challenging issue, in this paper, we propose several differentially private pairwise learning algorithms for both online and offline settings. Specifically, for the online setting, we first introduce a differentially private algorithm (called OnPairStrC) for strongly convex loss functions. Then, we extend this algorithm to general convex loss functions and give another differentially private algorithm (called OnPairC). For the offline setting, we also present two differentially private algorithms (called OffPairStrC and OffPairC) for strongly and general convex loss functions, respectively. These proposed algorithms can not only learn the model effectively from the data but also provide strong privacy protection guarantee for sensitive information in the training set. Extensive experiments on real-world datasets are conducted to evaluate the proposed algorithms and the experimental results support our theoretical analysis.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::aad0493da7f8d4760934983fe7d3546c https://doi.org/10.1609/aaai.v34i01.5411 Zobrazit plný text záznamu