Set-valued data collection with local differential privacy based on category hierarchy
Autor: | Xiao Zhenghong, Yinyin Xiao, Liu Shaopeng, Xiuxiu Liao, Ouyang Jia |
---|---|
Rok vydání: | 2021 |
Předmět: |
data collection
Computer science 02 engineering and technology computer.software_genre set-valued data Set (abstract data type) 0502 economics and business QA1-939 0202 electrical engineering electronic engineering information engineering Differential privacy Private information retrieval Hierarchy Data collection business.industry Applied Mathematics 05 social sciences utility function General Medicine Function (mathematics) local differential privacy privacy preservation Computational Mathematics Core (game theory) Modeling and Simulation 020201 artificial intelligence & image processing Data center Data mining General Agricultural and Biological Sciences business computer TP248.13-248.65 Mathematics 050203 business & management Biotechnology |
Zdroj: | Mathematical Biosciences and Engineering, Vol 18, Iss 3, Pp 2733-2763 (2021) |
ISSN: | 1551-0018 |
DOI: | 10.3934/mbe.2021139 |
Popis: | Set-valued data is extremely important and widely used in sensor technology and application. Recently, privacy protection for set-valued data under differential privacy (DP) has become a research hotspot. However, the DP model assumes that the data center is trustworthy, consequently, increasingly attention has been paid to the application of the local differential privacy model (LDP) for set-valued data. Constrained by the local differential privacy model, most methods randomly respond to the subset of set-valued data, and the data collector conducts statistics on the received data. There are two main problems with this kind of method: one is that the utility function used in the random response loses too much information; the other is that the privacy protection of the set-valued data category is usually ignored. To solve these problems, this paper proposes a set-valued data collection method (SetLDP) based on the category hierarchy under the local differential privacy model. The core idea is to first make a random response to the existence of the category, continue to disturb the item count if the category exists, and finally randomly respond to a candidate itemset based on the new utility function. Theory analysis and experimental results show that the SetLDP can not only preserve more information, but also protect the category private information in set-valued data. |
Databáze: | OpenAIRE |
Externí odkaz: |