An efficient perturbation approach for multivariate data in sensitive and reliable data mining
Autor: | Mahit Kumar Paul, A.H.M. Sarowar Sattar, Md. Rabiul Islam |
---|---|
Rok vydání: | 2021 |
Předmět: |
Normalization (statistics)
Computer Networks and Communications Computer science Scale (descriptive set theory) computer.software_genre Scalar multiplication Data set Information sensitivity Linear regression Benchmark (computing) Data mining Safety Risk Reliability and Quality Rotation (mathematics) computer Software |
Zdroj: | Journal of Information Security and Applications. 62:102954 |
ISSN: | 2214-2126 |
DOI: | 10.1016/j.jisa.2021.102954 |
Popis: | Due to the rapid enhancement of technology, cloud data is increasing rapidly which contains individuals’ sensitive information such as medical diagnostics reports. While extracting knowledge from those sensitive data, both privacy of individuals’ and the utility of data should be preserved which is a crucial concern in data mining related activities. Though therein exist several methods to preserve privacy, a single method can not maintain the harmony interim privacy and data utility. Often achieving individuals’ privacy leads to the loss of the data utility and the opposite is true also. To address the vital issue, a four-stage data perturbation approach, called NRoReM, is proposed in this work based on normalization, geometric rotation, linear regression, and scalar multiplication for sensitive data mining. The proposed approach is experimented with over ten UCI data set using three benchmark classifiers. The empirical exploration of privacy protection, attack resistance, information entropy analysis, data utility, and error analysis exhibits that NRoReM preserves both privacy of individuals’ and data utility on a larger scale for 90% of the data set than 3-Dimensional Rotation Transformation (3DRT) and 2-Dimensional Rotation Transformation (2DRT). |
Databáze: | OpenAIRE |
Externí odkaz: |