Privacy Preserving Data Mining for Ordinal Data using Correlation Based Transformation Strategy (CBTS)

Autor: P. Madhura, K. R. Venugopal, S. Geethanjali, M. Indiramma, Prasanth G. Rao, K. Neha Nandan, P. Deepa Shenoy, Chaitra C. Vaidya, N. P. Nethravathi
Rok vydání: 2016
Předmět:
Zdroj: Indian Journal of Science and Technology. 9
ISSN: 0974-5645
0974-6846
DOI: 10.17485/ijst/2016/v9i47/107360
Popis: Objectives: Preservation of privacy is a significant aspect of data mining. The main objective of PPDM is to hide or provide privacy to certain sensitive information so that they can be protected from unauthorized parties or intruders. Methods/ Statistical Analysis: Though privacy is achieved by hiding the sensitive or private data, it will affect the data mining algorithms in knowledge extraction, so an effective method or strategy is required to provide privacy to the data and simultaneously protecting the quality of data mining algorithms. Instead of removing or encrypting sensitive or private data, we make use of data transformation strategies that keep the statistical, semantic and heuristic nature of data while protecting the sensitive or private data. Findings: In this paper we studied the technical feasibility of realizing Privacy Preserving Data Mining. In the proposed work, Correlation Based Transformation Strategy for Privacy Preserving Data Mining is used for ordinal data. We apply the method on few datasets namely soybean, Breast Cancer, Nursery dataset and Car dataset. We tabulate the end results applying the proposed strategy on both the original and the transformed dataset and observe correlation difference, Information Entropy and Classification Accuracy with different machine learning algorithms and Clustering Quality. Application/Improvements: As an improvement, the proposed work can be extended by use of vector marking techniques where these techniques help in increasing the efficiency by avoiding unauthorised access to the information.
Databáze: OpenAIRE