A General Clustering Agreement Index: For Comparing Disjoint and Overlapping Clusters

Autor: Reihaneh Rabbany, Osmar Zaïane
Rok vydání: 2017
Předmět:
Zdroj: Proceedings of the AAAI Conference on Artificial Intelligence. 31
ISSN: 2374-3468
2159-5399
DOI: 10.1609/aaai.v31i1.10905
Popis: A clustering agreement index quantifies the similarity between two given clusterings. It is most commonly used to compare the results obtained from different clustering algorithms against the ground-truth clustering in the benchmark datasets. In this paper, we present a general Clustering Agreement Index (CAI) for comparing disjoint and overlapping clusterings. CAI is generic and introduces a family of clustering agreement indexes. In particular, the two widely used indexes of Adjusted Rand Index (ARI), and Normalized Mutual Information (NMI), are special cases of the CAI. Our index, therefore, provides overlapping extensions for both these commonly used indexes, whereas their original formulations are only defined for disjoint cases. Lastly, unlike previous indexes, CAI is flexible and can be adapted to incorporate the structure of the data, which is important when comparing clusters in networks, a.k.a communities.
Databáze: OpenAIRE