Parallelizing of the DBSCAN algorithm in the ClusterLogs framework

Autor: Ivan Zherdev, Maria Grigorieva, Konstantin Zhukov, Sergey Korobkov
Rok vydání: 2022
Předmět:
Zdroj: International Journal of Modern Physics A. 37
ISSN: 1793-656X
0217-751X
DOI: 10.1142/s0217751x2150247x
Popis: ClusterLogs is a framework for the automatic categorization of computing jobs and resources by error messages in distributed computing systems. Initially, it was developed for high-energy physics experiments, but it can be applied in other areas. The first prototype of the framework was limited to sequential execution and did not allow the processing of a large amount of data in an acceptable time. In the next prototype, the system was significantly improved by the parallelization of several data preprocessing stages. In this paper, we focus on the parallelization of the DBSCAN algorithm, the main method used for clustering of the numeric vectors representing the error messages.
Databáze: OpenAIRE