Scaling up the DBSCAN algorithm for clustering large spatial databases based on sampling technique

Autor: Bian Fuling, He Yanxiang, Zhou Shui-geng, Guan Jihong
Rok vydání: 2001
Předmět:
Zdroj: Wuhan University Journal of Natural Sciences. 6:467-473
ISSN: 1993-4998
1007-1202
DOI: 10.1007/bf03160286
Popis: Clustering, in data mining, is a useful technique for discovering interesting data distributions and patterns in the underlying data, and has many application fields, such as statistical data analysis, pattern recognition, image processing, and etc. We combine sampling technique with DBSCAN algorithm to cluster large spatial databases, and two sampling-based DBSCAN (SDBSCAN) algorithms are developed. One algorithm introduces sampling technique inside DBSCAN, and the other uses sampling procedure outside DBSCAN. Experimental results demonstrate that our algorithms are effective and efficient in clustering largescale spatial databases.
Databáze: OpenAIRE