A Hybrid Multi-group Privacy-Preserving Approach for Building Decision Trees.

Autor: Carbonell, Jaime G., Siekmann, Jörg, Zhi-Hua Zhou, Hang Li, Qiang Yang, Zhouxuan Teng, Wenliang Du
Zdroj: Advances in Knowledge Discovery & Data Mining; 2007, p296-307, 12p
Abstrakt: In this paper, we study the privacy-preserving decision tree building problem on vertically partitioned data. We made two contributions. First, we propose a novel hybrid approach, which takes advantage of the strength of the two existing approaches, randomization and the secure multi-party computation (SMC), to balance the accuracy and efficiency constraints. Compared to these two existing approaches, our proposed approach can achieve much better accuracy than randomization approach and much reduced computation cost than SMC approach. We also propose a multi-group scheme that makes it flexible for data miners to control the balance between data mining accuracy and privacy. We partition attributes into groups, and develop a scheme to conduct group-based randomization to achieve better data mining accuracy. We have implemented and evaluated the proposed schemes for the ID3 decision tree algorithm. [ABSTRACT FROM AUTHOR]
Databáze: Supplemental Index