Autor:	Shahrezaei, Maliheh Heydarpour, Tavoli, Reza
Rok vydání:	2019
Předmět:	Computer Science - Distributed Parallel and Cluster Computing
Druh dokumentu:	Working Paper
Popis:	K-means++ is an algorithm which is invented to improve the process of finding initial seeds in K-means algorithm. In this algorithm, initial seeds are chosen consecutively by a probability which is proportional to the distance to the nearest center. The most crucial problem of this algorithm is that when running in serial mode, it decreases the speed of clustering. In this paper, we aim to parallelize the most time consuming steps of the k-means++ algorithm. Our purpose is to reduce the running time while maintaining the quality of the serial algorithm.
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/1908.02136 Zobrazit plný text záznamu View this record from Arxiv