A Collaborative Framework for Privacy Preserving Fuzzy Co-Clustering of Vertically Distributed Cooccurrence Matrices
Autor: | Akira Notsu, Daiji Tanaka, Katsuhiro Honda, Toshiya Oda |
---|---|
Rok vydání: | 2015 |
Předmět: |
lcsh:Computer software
Structure (mathematical logic) Control and Optimization Fuzzy clustering Article Subject Process (engineering) Computer science Object (computer science) computer.software_genre Fuzzy logic Biclustering Computational Mathematics lcsh:QA76.75-76.765 Control and Systems Engineering lcsh:Electrical engineering. Electronics. Nuclear engineering Data mining Cluster analysis lcsh:TK1-9971 computer Categorical variable |
Zdroj: | Advances in Fuzzy Systems, Vol 2015 (2015) |
ISSN: | 1687-711X 1687-7101 |
DOI: | 10.1155/2015/729072 |
Popis: | In many real world data analysis tasks, it is expected that we can get much more useful knowledge by utilizing multiple databases stored in different organizations, such as cooperation groups, state organs, and allied countries. However, in many such organizations, they often hesitate to publish their databases because of privacy and security issues although they believe the advantages of collaborative analysis. This paper proposes a novel collaborative framework for utilizing vertically partitioned cooccurrence matrices in fuzzy co-cluster structure estimation, in which cooccurrence information among objects and items is separately stored in several sites. In order to utilize such distributed data sets without fear of information leaks, a privacy preserving procedure is introduced to fuzzy clustering for categorical multivariate data (FCCM). Withholding each element of cooccurrence matrices, only object memberships are shared by multiple sites and their (implicit) joint co-cluster structures are revealed through an iterative clustering process. Several experimental results demonstrate that collaborative analysis can contribute to revealing global intrinsic co-cluster structures of separate matrices rather than individual site-wise analysis. The novel framework makes it possible for many private and public organizations to share common data structural knowledge without fear of information leaks. |
Databáze: | OpenAIRE |
Externí odkaz: |