Scalable density-based clustering with quality guarantees using random projections

Autor: Johannes Schneider, Michail Vlachos
Rok vydání: 2017
Předmět:
Zdroj: Data Mining and Knowledge Discovery. 31:972-1005
ISSN: 1573-756X
1384-5810
Popis: Clustering offers significant insights in data analysis. Density-based algorithms have emerged as flexible and efficient techniques, able to discover high-quality and potentially irregularly shaped clusters. Here, we present scalable density-based clustering algorithms using random projections. Our clustering methodology achieves a speedup of two orders of magnitude compared with equivalent state-of-art density-based techniques, while offering analytical guarantees on the clustering quality in Euclidean space. Moreover, it does not introduce difficult to set parameters. We provide a comprehensive analysis of our algorithms and comparison with existing density-based algorithms.
Databáze: OpenAIRE