A Study of Adaptive Map/Reduce Affinity Propagation Algorithm

Autor: Yuan-Cheng Liu, 劉原呈
Rok vydání: 2014
Druh dokumentu: 學位論文 ; thesis
Popis: 102
The Affinity Propagation (AP) is a clustering algorithm based on the concept of “message passing” between data points. Unlike most clustering algorithms such as k-means, the AP does not require the number of clusters to be determined or estimated before running the algorithm. There are implementation of AP on Hadoop, a distribute cloud environment, called the Map/Reduce Affinity Propagation (MRAP). But the MRAP has a limitation: it is hard to know what value of parameter “preference” can yield an optimal clustering solution. The Adaptive Affinity Propagation Clustering (AAP) algorithm was proposed to overcome this limitation to decide the preference value in AP. In this study, we propose to combine these two methods as the Adaptive Map/Reduce Affinity Propagation (AMRAP), which divides the clustering task to multiple mappers and one reducer in Hadoop, and decides suitable preference values individually for each mapper. In the experiments, we compare the clustering results of the proposed AMRAP with the original MRAP method. The experiment results support that the proposed AMRAP method outperforms the original MRAP method in terms of accuracy, Davies–Bouldin index and Dunn Index. In the experiments, we compare the clustering result of the proposed AMRAP with the MRAP method. The experiment results support that the proposed AMRAP method has good performance at accuracy, Davies–Bouldin index and Dunn Index.
Databáze: Networked Digital Library of Theses & Dissertations