Pairwise Learning with Differential Privacy Guarantees
Autor: | Aidong Zhang, Di Wang, Chenglin Miao, Jinhui Xu, Mengdi Huai |
---|---|
Rok vydání: | 2020 |
Předmět: |
Online and offline
Training set business.industry Computer science 02 engineering and technology General Medicine Maximization Machine learning computer.software_genre 020204 information systems Metric (mathematics) 0202 electrical engineering electronic engineering information engineering Differential privacy 020201 artificial intelligence & image processing Artificial intelligence business Convex function computer |
Zdroj: | AAAI |
ISSN: | 2374-3468 2159-5399 |
DOI: | 10.1609/aaai.v34i01.5411 |
Popis: | Pairwise learning has received much attention recently as it is more capable of modeling the relative relationship between pairs of samples. Many machine learning tasks can be categorized as pairwise learning, such as AUC maximization and metric learning. Existing techniques for pairwise learning all fail to take into consideration a critical issue in their design, i.e., the protection of sensitive information in the training set. Models learned by such algorithms can implicitly memorize the details of sensitive information, which offers opportunity for malicious parties to infer it from the learned models. To address this challenging issue, in this paper, we propose several differentially private pairwise learning algorithms for both online and offline settings. Specifically, for the online setting, we first introduce a differentially private algorithm (called OnPairStrC) for strongly convex loss functions. Then, we extend this algorithm to general convex loss functions and give another differentially private algorithm (called OnPairC). For the offline setting, we also present two differentially private algorithms (called OffPairStrC and OffPairC) for strongly and general convex loss functions, respectively. These proposed algorithms can not only learn the model effectively from the data but also provide strong privacy protection guarantee for sensitive information in the training set. Extensive experiments on real-world datasets are conducted to evaluate the proposed algorithms and the experimental results support our theoretical analysis. |
Databáze: | OpenAIRE |
Externí odkaz: |