Instability of Hierarchical Cluster Analysis Due to Input Order of the Data: The PermuCLUSTER Solution
Autor: | Alexander M. J. Spaans, Willem A. van der Kloot, Willem J. Heiser |
---|---|
Rok vydání: | 2005 |
Předmět: |
Hierarchy (mathematics)
Economies of agglomeration Models Psychological Row and column spaces Instability Hierarchical clustering Matrix (mathematics) Goodness of fit Data Interpretation Statistical Statistics Cluster (physics) Cluster Analysis Humans Psychology (miscellaneous) Algorithm Mathematics |
Zdroj: | Psychological Methods. 10:468-476 |
ISSN: | 1939-1463 1082-989X |
DOI: | 10.1037/1082-989x.10.4.468 |
Popis: | Hierarchical agglomerative cluster analysis (HACA) may yield different solutions under permutations of the input order of the data. This instability is caused by ties, either in the initial proximity matrix or arising during agglomeration. The authors recommend to repeat the analysis on a large number of random permutations of the rows and columns of the proximity matrix and select a solution with the highest goodness-of-fit. This approach was implemented in an SPSS add-in, PermuCLUSTER, which can perform all HACA methods of SPSS. Analyses of 2 data sets show that (a) results are affected by input order, (b) instability in one method co-occurs with instability in other methods, and (c) some instability effects are more dramatic because they occur at higher agglomeration levels. |
Databáze: | OpenAIRE |
Externí odkaz: |