Finding subspace clusters using ranked neighborhoods
Autor: | Siegfried Nijssen, Matthijs van Leeuwen, Emin Aksehirli, Bart Goethals |
---|---|
Přispěvatelé: | Cui, P, Dy, J, Aggarwal, C, Zhou, ZH, Tuzhilin, A, Xiong, H, Wu, X |
Jazyk: | angličtina |
Rok vydání: | 2015 |
Předmět: |
Computer. Automation
Clustering high-dimensional data Fuzzy clustering business.industry Computer science Correlation clustering Single-linkage clustering Pattern recognition computer.software_genre ComputingMethodologies_PATTERNRECOGNITION Ranking CURE data clustering algorithm Canopy clustering algorithm FLAME clustering Artificial intelligence Data mining Cluster analysis business computer Subspace topology k-medians clustering Curse of dimensionality |
Zdroj: | ICDM Workshops IEEE 15th International Conference on Data Mining Workshops (ICDMW), NOV 14-17, 2015, ATlantic city, NJ 2015 IEEE International Conference on Data Mining Workshop (ICDMW) |
Popis: | Clustering high dimensional datasets is challenging due to the curse of dimensionality. One approach to address this challenge is to search for subspace clusters, i.e., clusters present in subsets of attributes. Recently the cartification algorithm was proposed to find such subspace clusters. The distinguishing feature of this algorithm is that it operates on a neighborhood database, in which for every object only the identities of the k closest objects are stored. Cartification was shown to produce better results than other state-of-the-art subspace clustering algorithms; however, which clusters it detects was also found to depend heavily on the setting of the parameters. In other words, it is not robust to input parameters. In this paper, we propose a new approach called ranked cartification that produces more robust results than ordinary cartification. We develop a transformation that creates ranked matrices instead of neighborhood databases; we identify clusters in these ranked matrices. We demonstrate that this method is more robust than cartification in terms of cluster detection. |
Databáze: | OpenAIRE |
Externí odkaz: |