Instability of Hierarchical Cluster Analysis Due to Input Order of the Data: The PermuCLUSTER Solution

Autor: Alexander M. J. Spaans, Willem A. van der Kloot, Willem J. Heiser
Rok vydání: 2005
Předmět:
Zdroj: Psychological Methods. 10:468-476
ISSN: 1939-1463
1082-989X
DOI: 10.1037/1082-989x.10.4.468
Popis: Hierarchical agglomerative cluster analysis (HACA) may yield different solutions under permutations of the input order of the data. This instability is caused by ties, either in the initial proximity matrix or arising during agglomeration. The authors recommend to repeat the analysis on a large number of random permutations of the rows and columns of the proximity matrix and select a solution with the highest goodness-of-fit. This approach was implemented in an SPSS add-in, PermuCLUSTER, which can perform all HACA methods of SPSS. Analyses of 2 data sets show that (a) results are affected by input order, (b) instability in one method co-occurs with instability in other methods, and (c) some instability effects are more dramatic because they occur at higher agglomeration levels.
Databáze: OpenAIRE