Parallelizing of the DBSCAN algorithm in the ClusterLogs framework
Autor: | Ivan Zherdev, Maria Grigorieva, Konstantin Zhukov, Sergey Korobkov |
---|---|
Rok vydání: | 2022 |
Předmět: | |
Zdroj: | International Journal of Modern Physics A. 37 |
ISSN: | 1793-656X 0217-751X |
DOI: | 10.1142/s0217751x2150247x |
Popis: | ClusterLogs is a framework for the automatic categorization of computing jobs and resources by error messages in distributed computing systems. Initially, it was developed for high-energy physics experiments, but it can be applied in other areas. The first prototype of the framework was limited to sequential execution and did not allow the processing of a large amount of data in an acceptable time. In the next prototype, the system was significantly improved by the parallelization of several data preprocessing stages. In this paper, we focus on the parallelization of the DBSCAN algorithm, the main method used for clustering of the numeric vectors representing the error messages. |
Databáze: | OpenAIRE |
Externí odkaz: |